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Abstract 

We describe a novel technique to characterize ultrashort electron bxmches in X- 
ray Free-Electron Lasers. Namely, we propose to use coherent Optical Transition 
Radiation to measure three-dimensional (3D) electron density distributions. Our 
method relies on the combination of two known diagnostics setups, an Optical 
Replica Synthesizer (ORS) and an Optical Transition Radiation (OTR) imager. Elec- 
tron bunches are modulated at optical wavelengths in the ORS setup. When these 
electron bunches pass through a metal foil target, coherent radiation pulses of 
tens MW power are generated. It is thereafter possible to exploit advantages of 
coherent imaging techniques, such as direct imaging, diffractive imaging, Fourier 
holography and their combinations. The proposed method opens up the possibility 
of real-time, wavelength-limited, single-shot 3D imaging of an ultrashort electron 
bunch. 
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1 Introduction 



Three X-ray Free-Electron Lasers (XFELs), LCLS [1], SCSS |0], and the Eu- 
ropean XFEL ll^ are currently under commissioning or under construction. 
These machines are based on the Self-Amplified Spontaneous Emission 
(SASE) process and will be operating with electron bunch durations 

of less than 100 fs. 

Operational success of XFELs will be related to the ability of monitoring 
the spatio-temporal structure of these sub-100 fs electron bunches as they 
travel along the XFEL structure. However, the femtosecond time-scale is 
beyond the scale of standard electronic display instrumentation. Therefore, 
the development of methods for characterizing such short electron bunches 
both in the longitudinal and in the transverse directions is a high-priority 
task, which is very challenging. 

A method for peak-current shape measurements of ultrashort electron bunches 
was proposed in ||^. It uses the undulator-based Optical Replica Synthe- 
sizer (ORS), together with the ultrashort laser pulse shape measurement 
technique called Frequency-Resolved Optical Gating (FROG) [9]. It was 
demonstrated in [8] that the peak-current profile for a single, ultrashort 
electron bunch could be determined with a resolution of a few femtosec- 
onds. The ORS method is currently being tested at the Free-electron laser in 



Hamburg (FLASH) HlOy. 



Recently, feasibility studies for integrating the ORS diagnostics setup with a 
timing scheme for pump-probe experiments and with a scheme for output- 
power stabilization of an X-ray SASE EEL were presented in |12(1 and 113(1 . 
Both schemes rely on an external optical laser to modulate the energy of 
an electron bunch in a short undulator. Such energy-modulation is subse- 
quently converted into density modulation by means of a dispersive section. 
Since the ORS setup already includes seed laser, energy modulator undula- 
tor and dispersion section, it is the most natural option to be considered in 
the implementation of both timing and stabilization schemes. 

In this paper we present a feasibility study for integrating the ORS setup 
with a high-resolution electron bunch imager based on coherent Optical 
Transition Radiation (OTR). 

Electron bunch imagers based on incoherent OTR constitute the main device 
presently available for the characterization of an ultrashort electron bunch 
in the transverse direction. They work by measuring the transverse intensity 
distribution. However, since no fast enough detector is presently available, 
the image is actually integrated over the duration of the electron bunch. 
Therefore, incoherent OTR imagers fail to measure the temporal dependence 
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of the charge density distribution within the bunch. For these reasons, the 
use of standard incoherent OTR imagers is limited to transverse electron- 
beam diagnostics, to measure e.g. the projected transverse emittance of 
electrons. 

However, it is primarily the emittance of electrons in short axial slice^^r 
which determines the performance of an XFEL. There is, therefore, a com- 
pelling need for the development of electron diagnostics capable of mea- 
suring three-dimensional (3D) ultrashort electron bunch structures with 
micron-level resolution. 

The main advantages of coherent OTR imaging with respect to the usual 
incoherent OTR imaging is in the coherence of the radiation pulse, and in 
the high photon flux. Exploitation of these advantages lead to applications 
of coherent OTR imaging that are not confined to diagnostics of the trans- 
verse distribution of electrons. The novel diagnostic techniques described 
here can indeed be used to determine the 3D distribution of electrons in a 
ultrashort single bunch. In combination with multi-shot measurements and 
quadrupole scans, they can also be used to determine the electron bunch 
slice emittance. 

The possibility of single-shot, 3D imaging of electron bunches with mi- 
croscale resolution makes coherent OTR imaging an ideal on-line tool for 
aligning the bunch formation system at XFELs. 

Future XFEL operation will set tight tolerances on electron bunch trajecto- 
ries. They need to be carefully monitored along the full-length of the ma- 
chine. In order to ensure SASE lasing at X-ray wavelengths, a very high orbit 
accuracy of a few microns has to be ensured in the 200 m long undulator. 
The resolution of incoherent OTR imagers is not adequate to characterize 
the position of the center of gravity of an electron bunch with such accuracy. 
Our studies show that coherent OTR imaging can be utilized as an effec- 
tive tool for measuring the absolute position of the electron bunch with the 
required micron accuracy. 

Finally, the improvement of bunch-imaging techniques up to the microscale 
level does not only yield a powerful diagnostic tool, but opens up new 
possibilities in XFEL technology as well. 

A pioneering experiment for integrating the ORS setup with a coherent OTR 
imager was performed at FLASH. The energy of the coherent OTR pulse 
was measured as a function of the position of a relatively short seed laser 
pulse along the (long) bunch, i.e. the slice peak-current was measure . First 

^ These slices are only a fraction of the full (100 fs) bunch length. 

^ The electron bunch was not compressed at the time of those measurements. 
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attempts to extract information about the correlation between longitudinal 
and transverse distribution also took place. These results, which should be 
considered as first steps into a novel direction of electron-beam diagnostics, 
are reported in [lUJ. 

In this work, we illustrate the potential of the proposed coherent OTR im- 
ager scheme for the case of the European XFEL [3]. We show that it naturally 
fits into the project. Technical realization will be straightforward and cost- 
effective, since it is essentially based on technical components (ORS, OTR 
diagnostics stations), which are already included in the design of the Euro- 
pean XFEL. In our analysis we considered the baseline parameters, so that 
our scheme can be implemented at the very first stage of operation of the 
European XFEL facility. However, the applicability of our method is not 
restricted to the European XFEL setup. Other projects, e.g. LCLS or SCSS 
may benefit from this work as well. 

Our paper is organized as follows. In the next Sections 2, 3 and 4 we in- 
troduce basic concepts and important details pertaining the Optical Replica 
method, the coherent OTR generation and the image formation of a modu- 
lated electron bunch. 



In Section 5 we analyze a coherent OTR imaging setup, the so called 4/ fil- 
tering architecture, which is relatively simple to implement experimentally. 
We demonstrate that such setup can be used to characterize electron den- 
sity profiles on the microscale level. Such resolution level can be reached by 
taking advantage of a number of options, including e.g. spatial filtering in 
the Fourier plane and radial-to-linear polarization conversion, which make 
coherent OTR imaging more accurate. 

Diffractive imaging is one of the most promising techniques for microscale 
imaging of electron bunches, when a detector records the Fraunhof er diffrac- 
tion pattern radiated by the electron bunch. Subsequently, an image can be 
reconstructed with the help of a phase retrieval algorithm. This method 
reduces the requirements on the optical hardware by increasing the sophis- 
tication in the post-processing of the data collected by the system. Besides, 
a diffractive imaging setup has the same ultimate resolution of the 4/ co- 
herent imaging setup. This extremely simple method is discussed for two 
versions, with and without the use of lenses, in Section 6. 

Fourier-Transform Holography (FTH) is analyzed as another promising 
imaging method in Section 7. FTH is a non-iterative imaging technique, 
so the image can be reconstructed in a single step deterministic computa- 
tion. This is achieved by placing a coherent point source at an appropriate 
distance from the object and having the object field interfering with the 
reference wave produced by this point source, detecting the interference 
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pattern in the Fourier plane. For optical applications, the resolution of holo- 
graphic techniques is not limited by size and quality of the point-like source. 
With the help of modem lithographic methods it is not difficult to produce a 
pinhole, unresolved at optical wavelengths, and let sufficiently bright radi- 
ation through it. The fast, unambiguous and direct reconstruction achieved 
in FTH is attractive for coherent OTR imaging of electron bunches. More- 
over, FTH may also be used to generate a low-resolution image of the bunch 
to support diffractive imaging techniques. In this case, multiple references 
can be added to the FTH setup in order to increase the a-priori information 
available. 

One of the main unsolved problems in XFEL electron bunch diagnos- 
tics is the characterization of bunches that have significant distortions in 
transverse phase space, e.g. bunches whose transverse phase-space ellipse 
varies along the beam itself. In an RF photoinjector with perfectly work- 
ing emittance compensation technique, the electron beam transverse pro- 
file is axis-symmetric, and the Twiss parameters are equal in all slices (ex- 
cluding the emittance, which varies). For a real beam, the variation in the 
space charge forces can be significant and cannot be properly compensated 
with a solenoid emittance compensation scheme. In addition. Coherent Syn- 
chrotron Radiation (CSR) related effects in bunch compressors can lead to 
further deviations from the axis-symmetric model. The knowledge of the 
variation of the phase-space ellipse along the bunch at the output of the 
bunch formation system could provide significant information about the 
physical mechanisms responsible for the generation of ultrashort bunches 
in XFELs. If the 3D structure of electron bunches could be provided (even 
as the result of a multi-shot measurement), a quadrupole-scan (which is a 
multi-shot measurement method too) could be used to provide phase-space 
density distribution measurements. Twiss parameters in each slice could be 
reconstructed in this way. 

As mentioned above, the applications of coherent OTR imaging are not 
confined to diagnostics of the transverse distribution of electrons projected 
along the longitudinal axis. 

Simple extensions of our proposed diagnostic techniques allow for the char- 
acterization of the 3D structure of electron bunches with a multi-shot mea- 
surement. Our approach, described in Section 6, involves a combination 
of real and reciprocal space imaging spectrometers. Both imaging setups 
use frequency filters to obtain the spectral data of the image. When the fil- 
ter bandpass is changed, successive images are recorded at different wave- 
lengths. This process is repeated, wavelength by wavelength, until the entire 
spatial spectral data is built up slice by slice. The result is the simultane- 
ous knowledge of two "3D cubes" of spectral data, one in the real space 
(AA, Ax, Ay) and the other in the reciprocal space (AA, l^^cox, Acuy), having 
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indicated with coj^y the spatial frequencies relative to the x and y axis. Appli- 
cation of e.g. the Gerchberg-Saxton algorithm [15], which is the first practical 
iterative algorithm to have been developed for solving the Fourier-phase- 
retrieval problem, allows one to retrieve the spatio-temporal electron-bunch 
structure. 

In Section 6 we also show how the determination of the projections of the 
cube of data in reciprocal space onto specific planes of interest is sufficient 
to reconstruct the electron-bunch structure, even without knowledge about 
the cube of spectral data in real space. The advantage of this method is 
that it requires no reference pulse shorter than the electron bunch (i.e. no 
synchronization). In other words, the optical replica pulse can be measured 
in the 3D Fourier domain. We name this novel method Frequency-Resolved 
Optical Diffractive Imaging (FRODI). 

FRODI is further developed in Section 6 from a multi-shot to a single-shot 
technique to measure the 3D structure of a single electron bunch. This is 
accomplished by splitting the beam and simultaneously measuring orthog- 
onal (x, t), (y, t) and (x, y) projections. The entire traces can be recorded 
by three detectors, and used to reconstruct the desired 3D electron-bunch 
structure. 

While FROG requires the use of nonlinear-optical process, FRODI is a linear- 
optical method, and linear-optical processes do not require intense pulses. 
Here we discuss about measurements of complicated pulses in three di- 
mensions, and it is interesting to compare FRODI with the well-known 
FROG technique, which can measure complicated pulses in one dimension 
(ID). As was reported in [9] concerning measurements of temporal struc- 
tures in optical pulses: 'Tt can be shown that linear-optical methods cannot 
completely measure ultrashort pulses". Quite counter-intuitively, the mea- 
surement of 3D structures of optical pulses in time and space is simpler 
than the measurement of temporal ID structures alone. However, in anal- 
ogy with this fact, it is well-known that the two-dimensional (2D) phase 
retrieval problem is solvable, unlike the ID one. To detect the temporal 
structure of optical pulses, FROG requires the use of a nonlinear optical 
process, which allows to extend the ID reconstruction problem to a 2D 
reconstruction problem. This involves an artificial 2D Fourier domain. Un- 
like it, FRODI transforms the problem of measurement of 3D structures in 
time and space into measurements of the more natural 3D spatial-frequency 
and temporal-frequency domains. No prior information about the electron 
bunch structure is necessary to reconstruct the electron bunch density dis- 
tribution from the experimental traces. 

Our 3D imaging technique FRODI turns out to be a relatively simple solution 
to a very complicated problem. For the 3D (spatio-temporal) electric field 
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of optical replica pulses produced by optically-modulated electron bunches 
with spatio-temporal distortions, different spatial frequencies are related to 
different temporal spectra (i.e. spatial frequency and temporal frequency 
are coupled). 

Multi-shot and single-shot techniques for the characterization of the electron 
bunch can also be based on FTH setups. A new multi-shot technique for 3D 
imaging of the electron bunch based on frequency gated FTH is discussed 
in Section 7. In the same Section 7 we also consider spatio-temporal FTH 
techniques. An extension of the method opens up the possibility for single- 
shot 3D imaging of ultrashort electron bunches. 

Time-gated FTH is the next class of techniques discussed in Section 7. The 
principle of this method is straightforward. A hologram records informa- 
tion about the object only when it is illuminated simultaneously by a coher- 
ent reference wave. Then, when a short reference is used, the hologram is 
equivalent to a time-gated viewing system [16]. We propose a method based 
on time-gated FTH with multiple reference sources capable of characteriz- 
ing the spatio-temporal structure of individual electron bunches. Multiple, 
ultrashort (about 10 fs) reference pulses are generated with a varying time- 
delay, so that several two-dimensional images (frames) of the electron bunch 
at different position inside the bunch can be reconstructed from a single holo- 
graphic pattern. We call this technique Holography Optical Time Resolved 
Imaging (HOTRI). 

We conclude our work in Section 9. 



2 Optical replica setup 

2.1 Optical replica pulse generation 

We propose to create a coherent pulse of optical radiation by first modulating 
the electron bunch at a given optical wavelength and, second, by letting it 
pass through a metal foil target, thus producing coherent Optical Transition 
Radiation (OTR) at the modulation wavelength. The radiation pulse should 
be produced in such a way as to constitute an exact replica of the electron 
bunch. Such optical replica can be used for the determination of the 3D 
structure of electron bunches. Although other projects may benefit from our 
study too, throughout this paper we will mainly refer to parameters and 
design of the European XFEL. 

In order to produce the optical replica we need to modulate the electron 
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bunch at a fixed optical wavelength. One may take advantage of an Optical 
Replica Synthesizer (ORS) modulator (s*], which we suppose to be installed 
after the BC2 bunch compressor chicane. A basic scheme to generate coher- 
ent OTR is shown in Fig. 1. 

A relatively long laser pulse serves as a seed for the modulator, consisting 
of a short undulator and a dispersion section. The central area of the laser 
pulse should overlap with the electron pulse. In order to ensure simple 
synchronization, the duration of the laser pulse should be much longer 
than the electron pulse time jitter, which is estimated to be of the order of 
100 fs. Foreseen parameters of the seed laser are: wavelength Am = 800 nm, 
energy in the laser pulse 1 mj and pulse duration (FWHM) 1 ps. The laser 
beam is focused onto the electron bunch in a short (the number of periods 
is Na, = 5) modulator undulator resonant at the optical wavelength of 800 
nm. Optimal conditions of focusing are met by positioning the laser beam 
waist into the center of the modulator undulator, with a Rayleigh length 
of the laser beam equal to the undulator length. Since the electron betatron 
function (}, the undulator length Lj„ and the Rayleigh length of the laser 
beam are of the same magnitude, the size of the laser beam waist turns out 
to be about 20 times larger than the electron beam size. As a consequence, 
we can approximate the laser beam with a plane wave when discussing 
about the modulation of the electron bunch. 

The seed laser pulse interacts with the electron beam in the modulator 
undulator and produces an amplitude of the energy modulation in the 
electron bunch of about 500 keV. Subsequently, the electron bunch passes 
through the dispersion section (with momentum compaction factor R^s - 
SOjUm), where the energy modulation is converted into density modulation 
at the laser wavelength. The electron bunch density modulation reaches an 
amplitude of about 10%. 

Finally, the modulated electron bunch travels through the OTR screen. A 
powerful burst of OTR is emitted, which contains coherent and incoherent 
parts. The coherent OTR has much greater number of photons (up to lO^"' 
i.e. l/,ij per pulse, as we will see), and can be used for diagnostic purposes. 
A quantitative treatment for coherent OTR is presented in Section 3. The 
way we can take advantage of coherent OTR properties is discussed in the 
following Sections. 

It should be mentioned that OTR screens can be positioned at various lo- 
cations down the electron beam line where electrons have substantially 
different energies. In the case of the European XFEL the electron energy 
varies from 2 GeV (second bunch compression chicane) up to 17.5 GeV at 
the undulator entrance. For other machines, these parameters differ. In the 
case of LCLS lUt], energies will range from about 4.5 GeV to 13.6 GeV. 
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Fig. 1. Schematic diagram of the coherent imager. The working principle is based 
on the optical modulation of the electron bunch and on emission of coherent OTR 
radiation from the metallic mirror. 

r-i 

Reference [8] includes a discussion about how to avoid the influence of 
self-interaction effects in the ORS setup, when the radiator is placed just 
behind the modulator. This case is practically realized, for example, when 
we want to use a 3D OTR imager to align the bunch formation system, and 
the OTR station is placed just after the ORS setup behind the second bunch 
compressor at 2 GeV. The situation changes when the OTR imager is placed 
behind the main XFEL accelerator at 17.5 GeV, and the distance between ORS 
modular and OTR imager is in the kilometer scale. In Section 2.3 we will 
present ideas how to avoid self -interaction effects in the high-energy case. 
When, instead, the ORS setup and the OTR imager are placed just behind 
the bunch compression system, it sufficient to use the ideas introduced in 



2.2 Optical modulator 

In order to use the coherent OTR burst for diagnostic purposes, one has to 
ensure that an optical replica of the electron bunch is actually produced. 
In fact, the electron bunch density modulation can be perturbed by col- 
lective fields. It is therefore important to consider collective interactions 
(radiation and space-charge fields) influencing the operation of the Opti- 
cal Replica modulator, to ensure that longitudinal dynamics in the Optical 
Replica modulator is governed by single-particle effects, independently of 
the presence of other particles. In particular, our method for electron-beam 
structure measurements is based on the assumption that the electron bunch 
density modulation does not appreciably change due to longitudinal space- 
charge (LSC) interactions, i.e. plasma oscillations, as the beam propagates 
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through the setup behind the Optical Replica modulator, up to the OTR sta- 
tion. Thus, the passage of the modulated electron bunch through the setup 
must be studied. 



Even when self-interactions are negligible, distortions can be introduced 
due to nonuniform local energy spread within the electron bunch. 

Let us first consider under which conditions on the Optical Replica modu- 
lator, the energy-spread effects are negligible. The way the electron bunch 



is modulated in the modulator is quantitatively described in e.g. |17|1 . The 
current I at the exit of the dispersion section is found to be a composition of 
harmonics of the modulation frequency cOm'. 




where Pq = 500 keV is the initial energy modulation. So is the nominal 
electron energy, V<(A6)2) is the local energy spread of electrons, z is the 
longitudinal coordinate, Vz is the longitudinal velocity of electrons and t is 
the time. Moreover /„ indicates the Bessel function of the first kind of order n 
and, as before, R56 is the momentum compaction factor. Note that the current 
is, in general a function of the time, i.e. Jq = ^o(0- However, throughout this 
paper we will make use of the adiabatic approximation, because the electron 
bunch is much longer than the modulation wavelength. As a result, we can 
consider Iq as a local parameter, and discuss about local amplitude and 
phase of the density modulation. 

As one can see from Eq. (1), the microbunching depends on the choice 
of the dispersion section strength. In fact, neglecting the exponential sup- 
pression factor in ^{{AS}^), the expression for the fundamental compo- 
nent of the bunched beam current is Ji oc 2Jo/i(X) ~ IqX for X «: 1, where 
X - PoR5(,/{A„,So), with A,„ = c/com, is a small dimensionless quantity known 
as the bunching parameter. 

One might think that all we have to do is to get the microbunching amplitude 
to a maximum by increasing the Rsf, of the dispersion section, thus increasing 
the output power. In fact, it is possible to build a dispersion section with 
a large R56 function. However, one of the main problems in the modulator 
operation is preventing the spread of microbunching due to local energy 
spread in the electron bunch. In other words, for effective operation, the 
value of the suppression factor in the exponential factor in Eq. (1) should be 
close to unity ISD. 
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Fig. 2. Energy spread profile (rms, solid line) at the entrance of the SASE undulator 
(after XFEL TDR) [3]). 

The energy spread is not constant along the electron bunch. For example, 
the energy-spread distribution in the case of the European XFEL is given 
in Fig. 2, reproduced from [3]. The maximal energy spread level is about 1 
MeV. Substituting numbers into the argument of the exponential function 
(remember that the chicane of the modulator has dispersion strength - 
50 /,tm) one finds that, for the first harmonic (n = 1) the exponential factor 
is about unity (- 0.998) even for &q = 2 GeV, which is the minimal energy 
considered. Moreover, the second harmonic (n = 2) of the modulation is 
suppressed by an order of magnitude with respect to the first, due to the 
Bessel J2 factor. 

As a result, in our case we can approximate Eq. (1) with 



I = Io+Io- ^— ^ COS 

AmOo 



(2) 



which ensures that the bunching is uniform along the beam. Calling = 
oJm[z/Vz{yo) - t] the modulation phase, the current can be expressed as J = 
Jo[l + acos{\p)], where a is the amplitude of our small {\a\ « 1) density 
modulation, taken with its own sign. 

Let us now discuss distortions due to self-interaction effects. Concerning 
the induced bunching inside the modulator undulator, perturbations due 
to collective effects are minimized up to a negligible level by using a small 
number of periods {Nw = 5). This optimization is also important in order to 
increase the replica resolution and minimize slippage effects (N^yA/c « aj, 
Ot being the electron bunch duration). 

Concerning the effect of LSC interactions, one needs a more detailed anal- 
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Fig. 3. Schematic diagram of the European XFEL setup from BC2 through SASEl. 



ysis. The propagation of the iriduced electron bunch density modulation 
through the setup is a problem involving self-interactions. If the OTR sta- 
tion is placed at a short distance (say, a few meters) from the modulator, e.g. 
to align the bunch formation system, plasma oscillations play no important 
effect, as it will be clear after reading the following analysis. However, the 
situation changes if one wants to characterize the electron beam after the 
linac, to monitor the electron bunch properties along the machine and, in 
particular, to monitor electron trajectories in the undulator. In fact, as the 
bunch progresses through the linac, the modulation of the bunch density 
produces energy modulation due to the longitudinal impedance caused 
by space-charge fields. This process is complicated by the fact that, due 
to the presence of energy and density modulation, plasma oscillations can 
develop. 



As a result, the initial energy and density modulation of the electron bunch 
will be modified by the passage through the setup. In order to study the fea- 
sibility of our scheme one needs to estimate what modifications take place. 
For a given facility, OTR screens positioned at a higher electron energy 
translate into a longer distance between modulator and OTR screen and, 
thus, into a stronger LSC influence. However, the evolution of plasma oscil- 
lations tends to slow down as the energy increases. This means that LCLS 
is less affected than the European XFEL, because the last magnetic bunch 
compressor at LCLS is positioned at higher energy (4.5 GeV compared to 
the 2 GeV of the European XFEL). 
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2.3 Distortions of the optical microbunching downstream the main accelerator 



Let us consider the European XFEL case, where self-interactions are more 



important (see Fig. 3). Proceeding as in |12l] we find that energy and density 
modulation along the accelerator are linked by the following system of 
differential equations: 



da 1 A}/(z) 



dz A (yo + gzf 



d{Ay) a{z) Iq 

~ GXID 

dz A(yo + gzf Ia 



(yo + gzfA^ 



T 



0, 



(yo + gzfA^ 



(4) 



where we assumed that y(z) = yo + gz, with g the acceleration gradient 
and yo - 4 • 10"'. The density modulation a has been defined above, and 
the energy modulation is given by Ay sm{ip). Here J/i - 17 kA is the Alfven 
current, e„ is the normalized emittance, jS is the average betatron function 
and T{a, (i) is the incomplete gamma function. 

Usually, calculating self-interaction effects is rather challenging. In our case, 
however, we used the straightforward ID model in Eq. (3) and Eq. (4). 
Remarkably, such model is not a rough approximation of reality, but it 
rather constitutes a quantitative approach. In fact, in our case study we deal 
with a very particular range of parameters where four asymptotic limits 
can be simultaneously exploited. First, retardation effects can be neglected 
due to a small current I « ylA, and to the adiabatic acceleration 
A{dy^/dz) «i 1. Second, the adiabatic approximation applies^.e. wcrj » 
1. Third, a pencil beam (ID) approximation is valid becausqlJ Or ^ yA, 
with Or the bunch transverse dimension. Finally, fourth, we can neglect the 
interaction of space-charge fields with material structures because yA ^b,b 
being the characteristic dimension of the vacuum chamber. As a result, when 
dealing with an XFEL setup and discussing about optical microbunching, 
we have a unique situation. If any of the conditions above ceases to be valid, 
the model in Eq. (3) and Eq. (4) ceases to be valid too, as it would be the case 
e.g. for calculations of impedance at lower frequencies. 



^ The adiabatic acceleration limit can always be used in our case. Assuming a 
constant acceleration gradient dy/dz = g ^ 25 m~^ (see [3]), we have 2Ay{dy /dz) ~ 
10~^ for y = 4 • lO'^, corresponding to the lowest energy of 2 GeV, and 2Ay{dy / dz) ~ 
10~^ for y = 3.5 • 10^, corresponding to the highest energy of 17.5 GeV. Note that 
the largest effects due to longitudinal impedance are expected in the first part of 
the acceleration process. 

4 For 7 = 4- 10^ we have, for parameters specified here, e„jS/(y'^A^) ~ 3 • 10"^. 
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Fig. 4. Evolution of the electron bunch density modulation as a function of the 
distance from the modulator. 

Let us then use Eq. (3) and Eq. (4) assuming an average betatron function 
of about jS = 25 m along the main accelerator and a normalized emittance 
Cn = 1.4 mm-mrad (see [3]). We set the acceleration length in the main linac 
equal to da - 1220 m. The undulator does not follow immediately the main 
linac, as it can be seen from Fig. 3. For example, the SASE 1 undulator is 
preceded by a 290 meters long straight section and by a coUimation system, 
which is 215 meters long. According to [3], the compaction factor of the 
collimator can be set to any value from R^^^ = -2 mm to R^^^ = 2 mm, with 
possibility of fine tuning (not in real time) of about ±100 ftm. 

In practical situations, one is interested in inserting OTR screens just after 
the bunch compression chicane (at 2 GeV) or along the undulator at 17.5 
GeV. The situation where the influence of self -interactions is most important 
is obviously at 17.5 GeV along the undulator. As shown in [lisl]. the longi- 
tudinal Lorentz factor yz = y/ Vl + K^/2 should be used in the undulator 
instead of y. Since K = 3.3, y^ and yl differ of about an order of magnitude, 
hence a different influence of the LSC in the undulator compared to the 
straight section. In the undulator, Eq. (3) and Eq. (4) are modified to 



da 
dz 



1 Ay dAy a{z) 1q 



dz 



Kyll 



-exp 



K^yyl 



0, 



K^yyl 



(5) 



where now y = 3.5 • 10^. Even for a simple case study where we set 
numerical analysis shows that our initial conditions fl, - 0.1 and (Ay), 
yield an unwanted evolution of a{z) as summarized in Fig. 4. 



0, 
^ 1 
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Fig. 5. Installation of a small dispersive element (chicane) after the coUimation 
system will allow to compensate positional-dependent variations of the electron 
bunch density modulation level. 

As one may see, the electron bunch density modulation is diminished by 
at most Afl/fl, = 25% at highest energies and at the end of the undulator. 
This constitutes a detrimental effect concerning our imaging techniques. 
In fact, we calculated a{z) assuming a fixed peak-current Iq - 5 kA. How- 
ever, the peak-current along the bunch is not constant, and from Eq. (3) 
and Eq. (4) follows that the modulation level varies along the bunch in a 
complicated way depending on the variation of the peak-current level. This 
effect is unwanted. In fact, different parts of the electron bunch should be 
given the right weight as concerns their contribution to radiation emission, 
meaning that the (absolute) charge modulation in each point of the electron 
bunch should be proportional to the charge density distribution of the un- 
modulated bunch. This can only be realized when the bunching factor is 
uniform along the bunch i.e. when it does not depend on the charge density 
distribution nor on the energy spread distribution. 

Yet, note that the change in the modulation level is small at any value of 
z, i.e. Afl(z) ^ fl;. As a result, from Eq. (3) and Eq. (4) follows that, in first 
approximation, the current-dependent terms in both a{z) and Ay depend on 
Iq linearly. Moreover, the design value for R^^^ is negative. Therefore, one may 
think of installing a small chicane as in Fig. 5 to organize a tunable negative 
compaction factor. Calling with Rf^ the total negative dispersion strength of 
collimator and chicane, we can compensate the current-dependent term in 
a at the position of the extra-chicane by requiring: 



a — a; = — 



A 7 



(6) 
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Fig. 6. Evolution of the level of the electron bunch density modulation for different 
values of the peak-current Jq- Here we account for the presence of a dispersive 
element R^^ ^ 12jUm at the collimation system. 

Note that from the differential equation for A7(z) and a{z) we obtain 
a(z) = Ui + /(z)A7i + g{z)lQ 

^y{z) = ^yi + h{z)Io , (7) 

where /(z), ^(z) and /i(z) follow from the integration of Eq. (3) and Eq. (4), 
assuming a(z) = Ui in that equation. A proper choice of can compensate 
for the part of the electron bunch density modulation proportional to Iq, 
but cannot compensate for the term in Ay,. However, such term does not 
depend on the position inside the bunch and is not detrimental. 

With this in mind, the easiest way to find the proper Rf^ is to set a(z) = 
Ui and Ayi = in Eq. (4), solve it for Ay{z) and find a{z) from Eq. (3). 
Then, substitution of Ay(z) and a{z) in Eq. (6) allows one to obtain, for our 
parameter choice, = -12|Um. 

The final step is to find the evolution of a{z) accounting for the presence 
of Rgg. Note that here we implicitly modelled the collimator and the small 
chicane after it as a 215-meter-long straight section followed by a single, 
localized dispersive element. Since the actual setup is more complicated than 
that in our simple model, results reported here are for the sake of illustration 
only. However, detailed calculations would not present novel effects due to 
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the influence of space-charge, so that exploitation of the tunability of R^*l is 
always possible, and our simple estimations are qualitatively correct. 



Results of numerical calculations for different values of the peak-current J( 







are shown in Fig. 6. The compensation effect of can be seen from the 
spread of the modulation levels before and after the dispersive element. 
In fact, from Fig. 6 one can see that such spread is reduced from about 
Aa/tti = 30% to about Aa/ui = 2% after the compensation chicane, to in- 
crease up to Aa/Ui = 5% at the end of the undulator. One concludes that LSC 
interactions do not introduce any undesired current-dependent modifica- 
tion to the amplitude of the level of the electron bunch density modulation 
with an accuracy Aa/Ui of a few percent. 

We should stress that numbers considered in these examples are for the 
worst possible influence of LSC. As said before, at LCLS these effects would 
be even less important, as one starts from higher energies after the second 
bunch compression chicane (at 4 GeV). One can take advantage of coherent 
OTR emission also at that facility, which will allow the electron bunch image 
not to be in the shadow of parasitic coherent OTR emission |ll9t]. In fact, with 



a laser-heater |l2O0, the induced uncorrelated energy spread is expected to 
limit the parasitic modulation of the bunch after the last compressor chicane 
to less than 1%, which is much smaller compared to the 10% modulation at 
optical wavelength induced on purpose in our schem^. 



3 Characteristics of the OTR source 



3.1 Qualitative description 



3.1.1 Parameter space of the problem 

As mentioned before, in this paper we deal with two situations of practical 
interest, both common to the European XFEL and LCLS. In the first, the OTR 
screen is positioned at low electron beam energy, just after the modulator. In 
the second, it is positioned at high electron beam energy, after the main linac 
and the collimator system, possibly within the undulator line. These two 
cases, shown in Fig. 5, have differences, but the overall qualitative picture 
is similar. 



It should also be realized that in our methods we will almost always use a narrow 
bandpass filter with relative bandwidth 0.1%. In this case, the influence of parasitic 
microbunching is strongly reduced. 
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Let us define the slowly varying envelope of the field in the space-frequency 
domain as E = Eexp[—icoz/c]. We will refer to this quantity simply as 
"the field". Here E{co,r,z) is the Fourier transform of the electric field 
E{t,r,z) in the space-time domain, according to the convention: E{co,r,z) = 
^ E{t, r, z) exp{i(joi\dt. Note that r indicates the transverse position vector. 

We will be interested in the OTR emission from an electron bunch modu- 
lated by the ORS. The typical longitudinal bunch dimension is in the order 
of SOjUm (FWHM), while the modulation wavelength Am ~ O.l/im. Then, 
the adiabatic approximation applies, and coherent OTR emission is auto- 
matically characterized by a narrow bandwidth around A = A,,,. This means 
that we can use the expression for the OTR field from a single electron in 
the space-frequency domain convolved with the instantaneous charge den- 
sity distribution as a good representation of the instantaneous OTR field. 
The OTR field from a single electron in space-frequency domain is usually 
approximated a£i] 



E {i^oc K, — — - 

cy \r - rJ \ 



(8) 



where is the first order modified Bessel function of the second kind, r is 
the observation position on the screen and ^s the electron position. Eq. (8) 
is the result of the Ginzburg-Frank theory |2l|l . 

We will see that the field distribution at the OTR screen is characterized 
by two scales of interest. One is associated with the transverse size of the 
electron bunch, a,- ~ 30 /im. The other is the typical size of the single- 
particle OTR spatial distribution, which can be estimated from Eq. (8) in the 
order of yA ~ 1 mm for the 2 GeV case, where we introduced the reduced 
wavelength A = c/cu. In our case study of interest cr^ ^ y^, and Eq. (8) can 
be approximated as 



-(1) r-r 



E i^^: ^- (9) 



This fact will be exploited through all our paper. 

We should stress here the vectorial nature of the electric field, which always 
exhibits space-variant polarization. This suggests an electrostatic 2-D anal- 
ogy with the electric field generated by an uniformly charged wire. For the 



^ Further on, when the longitudinal coordinate z and the frequency co will have a 
given, fixed value, we will not always include them in the arguments of the field. 
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parameters of our problem, this analogy is valid whenever r «: yA. This 
includes, in particular the range r < «; yK. 

Information about the electron bunch will be shown to be included in a small 
region of size Oy ~ SOfim corresponding to the region of nonzero electron 
density on the OTR screen, which we will call the "bunch" region, region 
A in Fig. 7. The region of interest of the imaging system is characterized by 
r < 100 jj.m, region B in Fig. 7. 

The field distribution for 100 fim < r < 300 ^um does not depend on the 
transverse size of the electron beam. In other words, a filament beam ap- 
proximation applies. We will refer to this region as the "halo" region, region 
C in Fig. 7. Note that in the halo region, information about the peak-current 
distribution is encoded in the dependence of field amplitude versus time. 

The regions A, B and C are not influenced by the position of magnetic struc- 
tures. These become relevant at larger distances r > 300 ^m ~ yA, region D 
in Fig. 7. Note that the field in region D remains independent of emittance 
effects, i.e. the filament beam approximation is still valid. Therefore, infor- 
mation about the peak-current distribution can be extracted from region D 
exactly as from region C. 

It should be remarked that numbers given here refer to the relatively low 
electron beam energy case of 2 GeV, and should be multiplied by a corre- 
sponding factor when these considerations are extended to the high energy 
case of 17.5 GeV. 

As mentioned before, as a result of the adiabatic approximation, the field 
distribution seen on the screen actually depends on the instantaneous trans- 
verse charge density distribution. In the bunch region, the field distribution 
evolves due to the dependence of the electron bunch transverse size on the 
longitudinal coordinate, while in the halo region the field distribution is 
constant, although its amplitude depends on the peak-current. 

In this paper we will be mainly interested in the field within the bunch re- 
gion. This is important for imaging purposes. However, we will occasionally 
need to characterize the halo region too. For example, as we will see, there 
is a possibility of simplifying the Optical Replica setup for peak-current 
profile measurement by taking advantage of an OTR screen as a radiator. In 
this case, characterization of the halo region is needed. 

Outside of the bunch region an exact characterization of the field becomes 
more complicated, because the approximation of the single-electron field 
considered up to now (proportional to Ki) fails when r approaches yA. 
In fact, the influence of magnetic structures upstream of the OTR screen 
starts to be important. Fortunately, not the halo region, nor the D region for 
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Fig. 7. Scheme of the coherent OTR radiation pattern as observed in the object plane 
for the low electron beam energy case of 2 GeV. 

r > yA will be involved in the imaging process. In fact, for the halo region 
the filament-beam asymptote holds: in this region only information about 
the peak-current is encoded. 



3.1.2 Similarity techniques 

Up to now we gave a picture of our problem introducing the parameter 
space of interest. Based on the adiabatic approximation, we mentioned that 
the coherent OTR field can be written as a convolution between the charge 
density distribution and the single-electron field. In particular, we used the 
Ginzburg-Frank formula to describe the field produced by a single electron 
on the OTR screen (proportional to Ki). 

With the help of similarity techniques it is possible to present qualitative 
arguments to explain why the Ginzburg-Frank formula can be used in our 
case study, and why we can neglect the presence of bending magnet and 
straight section in our setup. Let us consider a setup composed by a bend, 
a straight section, and a target (the OTR screen). Radiation characteristics 
were shown to depend on two dimensionless parameters [23(1 : 



6 = 



(10) 
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where R is the radius of the bend, the reduced wavelength of the radiation 
A coincides, as before, with the reduced modulation wavelength A„„ and L 
is the length of the straight section. 

By definition, 1 /6 is a measure of the straight section length L in units of the 
synchrotron radiation formation length '\/R^A. The parameter (p, instead, 
is a measure of the straight section length L in units of the characteristic 
length y^A. In the case of a straight section of length L, the apparent distance 
travelled by an electron as seen by an observer is equal to L/(2y^). Since 
it does not make sense to distinguish between points within the apparent 
electron trajectory such that L/(2y^) < A, one obtains a critical length of 
interest ~ y^A. 



It is known (see e.g. 11230) that the Ginzburg-Frank theory used above is a 
limiting case of the more general Edge Radiation (ER) theor}|Zl While the 
theory of Edge Radiation accounts for the presence (but not for the detailed 
structure) of magnetic elements, the Ginzburg-Frank theory does not. In 
the theory by Ginzburg and Frank, an electron coming from an infinitely 
long straight line (L — > oo) and crossing an interface between vacuum and 
an ideal conductor is considered. As a consequence of the crossing, time- 
varying currents are induced at the boundary. These currents are responsible 
for Transition Radiation. The metallic mirror, that is treated as the source of 
Transition Radiation, is usually modelled with the help of a Physical Optics 
approach. This is a well-known high-frequency approximation technique, 
often used in the analysis of electromagnetic waves scattered from large 
metallic objects. Surface current entering as the source term in the propa- 
gation equations of the scattered field are calculated by assuming that the 
magnetic field induced on the surface of the object can be characterized 
using Geometrical Optics, i.e. assuming that the surface is locally replaced, 
at each point, by its tangent plane. One may talk of Transition Radiation in 



the sense by Ginzburg and Frank ||21|] if 6 ■ (p <^ 1, 6 <^ 1 and, additionally, 
(/)»!. 

In fact, when cp » 1 one can neglect the presence of the upstream bend 
in the setup, and consider only an electron crossing an interface between 
vacuum and an ideal conductor. Condition 6 ■ (p ^ 1 means that the crit- 
ical wavelength of Synchrotron Radiation from the bend is much shorter 
than the (optical) radiation wavelength we are interested in, i.e. A » R/y^- 
Condition 6^1 means that Synchrotron Radiation from bending magnet 
radiation is characterized by a significantly larger opening angle, compared 
to that of Edge Radiation. From the viewpoint of electromagnetic sources 
(i.e. harmonics of the current and charge density), 6 «: 1 means that a 



Which, in its turn, is a limiting case of the more general Synchrotron Radiation 
theory. 
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Fig. 8. Characteristic transverse size Oc and divergence 6c of a diffraction-limited 
source, and its relation with the formation length L which is interpreted as the 
apparent length of the source, adapted from [22]. 

sharp-edge approximation can be enforced, i.e. one can neglect the way 
electromagnetic sources begin or cease to exist. 

In our cases of interest, analysis shows that (p ^ 1 for the low energy 
case, ~ 1 for the high energy case, and in both cases 5 « 1, 6 • « 1. 
Although[i](/5 ~ 1/ when r ^ yA (i.e. within region B of Fig. 7) we can still use 
the Ginzburg-Frank theory and describe the field at the OTR screen with a 
distribution proportional to the asymptotic expression for K-[[r/{yA)]/{yA). 
In fact, on the one hand the parameter (p compares the length of the straight 
section L to y^A. On the other hand, however, if we are interested in a 
region of the OTR screen such that r « yA, the effective formation length is 
decreased to about L^(r) ~ r^/A. We can easily estimate the relation between 
source size and formation length remembering that radiation from a single 
electron is diffraction limited and using for estimation the fact that for 
diffraction limited radiation the photon emittance is given by dcOc ~ A, 
where Oc is the characteristic transverse size and dc is the characteristic 
divergence of the source (see Fig. 8). Since Oc ~ Lfdc, one has dc ~ odLf, 
which yields L /(Cc) ~ / A. For example, for region A the effective formation 
length is of order of 1 cm, while for the full region of interest of our imaging 
system (region B) we have ~ 10 cm. This is much shorter compared to 
the distance between the OTR screen and the upstream bending magnet. 
Thus, within the deep asymptotic region r « yA, i.e. within regions A and 
B of Fig. 7, the Ginzburg-Frank formula can be applied, while it begins to be 
less and less accurate in region C, up to r ~ yA, where ~ y^A and ~ 1. 
Note that ~ y^A is the maximal formation length because, transversely. 



^ In contrast, for example, with the case of Optical Diffraction Radiation imagers 
with impact parameters of order yA as discussed in |23l] . 
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the radiation intensity is cut off at r > yA. 

Further complications arise from the fact that the OTR screen can be po- 
sitioned within the undulator line. There the longitudinal velocity of elec- 
trons is effectively decreased, resulting in effectively weaker electromagnetic 
sources. The typical transverse size on the OTR screen associated with the 
electromagnetic field decreases from yK to y^A, where y\ = y^ /{1+K^ /2) and 
K = 3.3 for the SASEl undulator of the European XFEL. This means that the 
transverse size decreases of about a factor 3, and 6 is effectively increased. 
However, here we will be interested in much smaller transverse scales of or- 
der of the transverse size of the electron bunch ~ 30/jm, whereas yA - 4.4 
mm and yzA - 1.7 mm. Thus, we expect that the field distribution changes 
in the peripheral region between y^A and yA in the halo region C, but not 
in the bunch region with r < Or nor in the region B. Since, as we will see, 
these size modifications enter only logarithmically in the calculation of the 
number of photons, they may be neglected in first approximation. 



3.2 OTR from a single electron 



Consider a setup composed by a bend, a straight section, and a metallic 
mirror. In order to solve the problem of field characterization, a virtual 
source method described in [23], based on the more general work [25], 
is taken advantage of. Following that reference, and defining the electron 
charge as (-e), we obtain at the mirror positior{^ i.e. at z = : 



f ^ Iwer 
cy r 





2coer 


icoL 




c^y r 


2cy2 



X exp 



tcor^ 
'icL 



oo 



'icL 



(11) 



which is valid for any value of (p under conditions 6^1 and b-(p <^ 1. Note 
that the field is radially polarized. One may deal with two asymptotes of 
the theory for » 1 and <^1. 

As discussed before, when » 1 Eq. (11) yields back the usual Ginzburg- 
Frank theory 



^ This convention differs for technical reasons from that adopted in [23|] where the 
mirror position is at z = L/2. 
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The following result holds for the asymptote (p ^1: 



E(r) = T exp 

c 

which comes from the second term in Eq. (11). The quadratic phase factor in 
Eq. (13) describes a spherical wavefront centered on the optical axis at the 
upstream edge (the initial bend). The screen positioned at the downstream 
end of the setup detects an electric field given by the spherical wavefront in 
Eq. (13) from the upstream magnet at any value VaR^ «: L «; y^A and for 
r <^ L -^A/R. Note that when, additionally, r «: VaL, the quadratic phase 
factor in Eq. (13) can be dropped. 

In our cases of interest (p ~ 1, which is outside the region of applicability of 
the two asymptotes in Eq. (12) and Eq. (13). However, if we are interested in 
imaging an electron bunch with transverse size Or <^ yA we only deal with 
the bunch region, i.e. with the deep asymptotic region A in Fig. 7, where 
r < Or yA ~ VAL. In this region, the argument of the /i function under 
the integral sign in Eq. (11) is much smaller than unity, because r' < yA 
due to the presence of the Ki function under the same integral. As a result 
we can substitute Jiicorr' /{cL)] with corf /{2cL) inside the integral. Then, the 
integration can be performed analytically. Analysis of the result shows that 
the magnitude of the second term in Eq. (11) is much smaller (of a factor 
of order r/{yA)) than the first term in Ki. We conclude that Eq. (12) holds 
within the bunch region A in Fig. 7. 

Proceeding to larger values of r outside the bunch region A, i.e. outside the 
deep asymptotic region r <3C yA, and through regions B and C in Fig. 7, Eq. 
(12) begins to be less and less accurate. However, as we will see, corrections 
to Eq. (12) enter only logarithmically in the calculation of the number of 
photons in the halo. Therefore, as concerns photon number estimations, we 
may still use Eq. (12) in the halo region C with logarithmic accuracy. 

Finally, let us discuss OTR emission in the far-zone. Fresnel propagation can 
be performed on Eq. (12), yielding the following far-zone expression for the 
single-particle field 



2cL 



(13) 



2ey^^ 
cziy^e + 1) 



exp 



2c 



(14) 



where S, = r/z, and z is the distance between OTR screen and observation 
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plane. 



The energy radiated per unit frequency interval per unit surface can be 
calculated as in ||26l] . From the relation 



dcodS 471^ 
we have, in the far-zone: 



(15) 



4r2 



dcvdS n^cz^ {y^e + 1? ' 
One can write Eq. (16) in terms of number of photons, giving 



(16) 



ph 



a 



4r2 



dcvdS n^ojz^ {y^E} + If ' 



(17) 



with a = e^/{hc) = 1/137 the fine structure constant. Here and in Eq. (16) the 
superscript indicates that we are dealing with single particle emission. 



3.3 Coherent OTRfrom an optically modulated electron hunch 



Coherent transition radiation has been introduced a long time ago into the 
array of beam diagnostics available for measuring the microbunching in- 
duced in SASE FELs in the infrared, visible and VUV regimes [27.]-|j33i]. 
This microbunching diagnostics provides information about the longitudi- 
nal dynamics of the electron beam in the EEL. There are other applications 
of coherent OTR diagnostics. For example, recently, coherent OTR imag- 
ing techniques were used for measuring the microbunching induced by the 
space-charge instability at different locations at LCLS [19]. All these ap- 
plications are based on measuring the microbunching induced by EEL or 
space charge interactions, effects which introduce distortions in the image 
formation. We explained how to avoid these effects in Section 2. 

Having said this, we can apply our knowledge about the OTR from a single 
electron to the case of an optically modulated electron bunch. In order to 
obtain the field from the electron bunch, a microscopic approach can be 
used where the single-electron field is averaged over the six-dimensional 
phase-space distribution of electrons. In this (Lagrangian) approach parti- 
cles are labelled with a given index, and the motion of individual charges 
is tracked through space. One follows the evolution of each particle as a 
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function of energy deviation 5y, angular direction rf, position / and arrival 
time T at a given longitudinal reference-position. Knowing the evolution of 
each particle, individual contributions to the field are separately calculated 
and summed up. Due to the high-quality electron beams produced at XFELs 
(highly collimated and nearly monochromatic) we have the simplest possi- 
ble situation. Namely, when performing OTR calculations from an optically 
modulated electron bunch we can neglect both angular and energy distri- 
bution, and use a model of a cold electron bunch with given longitudinal. 



/^(t), and transverse, /;(/), charge density distributions. For a modulated 
electron bunch we write f^{t) as 



Note that in general we have no factorization of the charge density dis- 

tribution into longitudinal /^(t) and transverse /;(/) factors. However, here 
we will be interested in an estimate of the number of available photons 
and, with some accuracy, we can use a model with separable charge density 
distribution function. In fact, this assumption is not related to fundamental 
principles, and will only lead to a different numerical factor in the estimation 
of the number of photons. 

In regions C and D of Fig. 7 we have (/) ~ 1, and one should use Eq. (11) to 
characterize the field. Only where r ^yX one may use the Xi approximation 
in Eq. (12), i.e. the Ginzburg-Frank approximation. However, as mentioned 
above, we may still use Eq. (12) in regions C and D with logarithmic accuracy 
when dealing with photon number estimations. 

The field distribution for the electron bunch at the OTR screen in the space- 
frequency domain is essentially a convolution in the space domain of the 
temporal Fourier transform of the charge density distribution and the tem- 
poral Fourier transform of the single-electron field. When the charge density 
distribution of the electron bunch can be factorized as product of longitudi- 



where ft{co) is the Fourier transform of the temporal charge density distri- 
bution /t(t). 

Assuming a Gaussian transverse charge density distribution of the electron 

bunch with rms size Or, i.e. //(I) = {2no^) exp[-/^/(2cr^)] and substituting the 
single-particle OTR field, Eq. (12) into Eq. (19), we obtain 



/t(0 = /to(0[1 + afCos{co„t)] . 



(18) 



nal, /t(t), and transverse, //(/), factors, one obtains: 




(19) 



30 




' I ' I ' I ' "'I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' I ' 

25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 

r(nm) 



Fig. 9. Radial component of the OTR field in the mirror plane, Eq. (21), transverse 
(Gaussian) charge density distribution single particle field, Eq. (12), scaling as 
Ki[r/{yK)]/{yX), and its asymptote for r «c yA scaling as 1/r. Here the electron en- 
ergy is 2 GeV, the electron bunch transverse rms size is Or = 30 [im, the modulation 
wavelength is, as usual A = 800 nm. 

E{r) = — dr -KA — exp 

ncOy J ycr' \yc J 

Eq. (20) can be shown to be equivalent to 



(f - ff 



(20) 



£(r) = 



2Neef,{co)r 



cot 



■- exp 
r 



2^ 



oo 



(21) 



where Ji is the modified Bessel function of the first kind. We plot the radial 
component of Eq. (21) in Fig. 9, together with the charge transverse density 
distribution //(r), the single particle field in Eq. (12) and its asymptote pro- 
portional to 1/r for r «; 7 A. For this example, the chosen energy is 2 GeV, 
the electron bunch transverse rms size is Or = 30 jUm and the modulation 
wavelength is, as usual, A = 800 nm. As one can see from Fig. 9, outside the 
region B (i.e. within regions C and D in Fig. 7) one has no more emittance 
effects, and emission can be considered as radiation from a filament elec- 
tron bunch (with no transverse dimensions). Emittance effects are present 
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within the B region and the A (bunch) region. They encode information 
about the transverse charge density distribution. Note that the outer part of 
region B is already fitting well the filament-beam asymptote in the region 
r «; 7 A, where it is well approximated by a 1 /r behavior. Finally, for values 
of r > yA we expect modifications of the K\ behavior due to edge radiation 
contributions. 

Similarly to Eq. (16), we can calculate the spectral distribution of coherent 
photons on the OTR screen by taking squared modulus of Eq. (19). The 
field in the Fraunhofer zone is linked to that at the OTR screen position by 
a Fourier transform. Then, it can be seen that the spectral distribution of 
photons in the far-zone is equal to the product of the single-electron result 
in the far-zone, Eq. (17), and the squared-modulus of the structure factor 



dxf,{x)exp[ia)x] J rfZ/KOexp [^.f • /] , 
which can be expressed as 



(22) 



(23) 



— — 

where fz{co) and fi{E.) are the Fourier transforms of the temporal and trans- 
verse charge density distributions (and can be identified with the two factors 
in Eq. (22)). Thus, one has 



dcodS ^ dcodS 



(24) 



In case of a Gaussian transverse charge density distribution we have: 



/7(<^) = exp 



2c2 



(25) 



A Gaussian temporal electron bunch profile with an rms bunch duration Oj 
and an amplitude of density modulation Uf is also assumed. We define the 
function /,o(0 = l/(27i4)-^/2 . exp[-f/{2al)] and, with the help of Eq. (18) 
we obtain 



Uf ( 

/tM = yiexp 



-—{fx)-(x)m) 



-I- exp 







1 -1- exp 


2 



(26) 



where cOm = 2n/Am. 
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3.3.1 Adiabatic approximation 

We now apply an adiabatic approximation, which relies on the fact that the 
bunch duration Ot is much longer than the period of the modulation. In 
other words, Otco^ » 1. This allows us to simplify Eq. (26) as 



M(^) = Y exp 



(27) 



which is valid for frequencies co around the modulation frequency cOm- From 
Eq. (27) follows that radiation is exponentially suppressed for frequencies 
outside the bandwidth l/oj centered at the modulation frequency co^. 



3.3.2 Angular distribution of coherent OTR photons emitted by an optically 
modulated Gaussian electron bunch 

Since we are interested in coherent emission around the modulation wave- 
length, we can consider the wavelength in Eq. (17) and Eq. (25) fixed. Then, 
according to Eq. (24), the calculation of the angular distribution of photons 
emitted around cOm amounts to a multiplication of Eq. (17) by \fi\'^ and by 



oo 

j dco \Mco)f = yfRaj/i^ar) , (28) 



leading to the following expression for the number of coherent OTR photons 
emitted by an electron bunch per unit solid angle: 

where dQ, - ^d^dcp, (p being the aztmuthal angle. 



(29) 



3.3.3 Total number of coherent OTR photons emitted by an optically modulated 
Gaussian electron bunch 

To estimate the total number of coherent OTR photons it is sufficient to 
integrate Eq. (29) over dQ, giving 

Nph = [-1 + exp(N)(l + N)r(0, N)] , (30) 
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where, as before, T{a, j8) is the incomplete gamma function, and we defined 
N = oil (A^y"^). The parameter N is analogous to a Fresnel number in diffrac- 
tion theory, and is the only (dimensionless) transverse parameter related to 
radiation emission in our Gaussian model. 

In the high energy case N ~ 10"^, whereas in the low energy case N ~ 10"'^. 
Considering Uf - 0.1, - 6 ■ 10^, i.e. about 1 nC of charge, - 80 fs we 
obtain, for A = 800 nm, a total number of about 10^^ coherent photons per 
OTR pulse into a 10-nm bandwitHB 

In the limit for small values of N one obtains the following asymptotic 
expression for Eq. (30): 

Nph - [-1 -Te- ln(N)] , (31) 

Te - 0.58 being the Euler constant. The asymptote in Eq. (31) canbe exploited 
when dealing with XFEL setups, because of the extreme high-quality of the 
electron bunch (small emittance). As stated before, the electron beam size 
and the transverse dimension y A enter only logarithmically in the expression 
for the number of photons. Therefore, the number of photons available is 
almost insensitive to the value of N. 

Finally, it is interesting to estimate the number of coherent OTR photons in 
the bunch region A, Fig. 7. This can be done substituting Eq. (20) into Eq. 
(15) and integrating over co with the help of Eq. (28). This yields the energy 
radiated per unit surface on the OTR screen. Then, integrating over dS and 
dividing by hco yields about 10^^ photons in the bunch region. 



3.3.4 Angular distribution of coherent OTR photons in the case of arbitrary peak- 
current profile 

Within the adiabatic approximation, i.e. for OjcOm ^ 1/ it makes sense to 
introduce an expression for the instantaneous power as a function of the 
peak-current 7o(0 ^rid modulation a{t). 

To this purpose we consider, first, a stepped-profile model for the bunch 
/to(0 = 1/T for -T/2 < t < T/2 and zero elsewhere. We also suppose 
a{t) = af constant to start with. It follows that 



Note, for comparison, that the estimated number of incoherent photons per OTR 
pulse is about 10^ into a 100-nm bandwidth. 



34 



- [sin[r(a; + a;m)/2] sm [r(a; - a;,„)/2] 1 
f^{a))=afl — — + — \ . 

[ T{a) + a)m) T{a)-a)n,) J 

(32) 

Then, in the limit for T » Am /c one has rfo; |/^(a;)| ^a^ttT/Z, and 



dP 
dS 



IdW _ g^/Jp^ 
T dS ~ 8n^e^ 



exp 



^(1) 

E 



(33) 



where dP/dS power per unit surface. If now the peak-current Iq = Io{t) and 
the modulation level a = a{t) are slowly varying functions of time on the 
scale Am/c, we can interpret Eq. (33) as the instantaneous power density at 
time t, and with the help of Eq. (14) we obtain the following expression for 
photon flux radiated into the unit solid angle: 



dNph aa{t)%{tf y^e 



dn 



exp 



(34) 



Eq. (34) can be used in order to study the general case of an electron bunch 
with arbitrary gradient profile and amplitude of modulation. 



3.3.5 Ejfect of angular filtering 

To conclude this Section, it is interesting to consider the effect of angular 
filtering [23]. For mathematical simplicity, let us introduce the angular fil- 
ter profile through an amplitude transmittance T(^). We choose a simple 
Gaussian model for T(^), and we write: 



m) = exp 



20 



(35) 



Then, the effect of angular filtering is accounted for by multiplying Eq. (29) 
by T^{^), i.e. the squared of the amplitude transmittance, and by integrating 

-i 

over angles dE,. One obtains 



N^a^a 

Nph = — [-1 + exp(N + A)(l +N + A)r(0, N + A)] , (36) 

4 yIncOmOT 

where A = ll{ydaf. Eq. (36) is analogous to Eq. (30), where A now plays the 
same role of the N parameter and accounts for angular filtering effects. 
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4 OTR imager 



Having characterized the radiation from an OTR screen for both a single 
electron and an optically modulated electron bunch, we will now consider 
the physics of the image formation process. In particular we will discuss the 
OTR imager setup up to the image plane where the detector is placed. 

A simple setup for OTR imaging is schematically shown in Fig. 10. Radiation 
is reflected by an annular mirror (which allows the passage of the electron 
bunch) and an image is formed in the image plane with the help of a 
converging lens. In this case, the object plane is the OTR screen. 

The annular-mirror design depicted in Fig. 10 is usually applied in the 
measurement of transition radiation around the THz frequency range for 
longitudinal profile characterization. However, for optical frequencies (A ~ 
0.1 ftm) an important fraction of OTR will be lost through the center hole. 
A possible solution to this problem is to avoid the use of an annular mirror 
as shown in Fig. 11, where a near-normal-incidence scheme is shown. Note 
that in order to compensate for the small tilt-angle of the object plane one 
needs to tilt the image plane as well. The arrangement in Fig. 11 is currently 
used for incoherent OTR imaging of electron beams at LCLS (see e.g. [3^). 

One should therefore consider the mirror design in Fig. 11, rather than that 
in Fig. 10. However, for the sake of simplicity of drawing, we will still refer 
to the non-tilted OTR screen design in Fig. 10. This does not make any 
difference concerning the description of our methods. 

Another optical system traditionally used for imaging purposes is the well- 
known two-lens image-formation scheme in Fig. 12. This scheme allows 
for magnification by changing the focal distance of the second lens but for 
simplicity, in the following we will assume that the two focal distances 
are the same (i.e. we consider 1 : 1 imaging). This two-lens setup is usually 
employed for image-processing purposes, as it can be better used for image- 
modification compared to the single-lens system. Therefore, in the following 
we will systematically consider the two-lens scheme in Fig. 12 instead of the 
single-lens scheme described in Fig. 11. 



4.1 Theoretical basis for the analysis of coherent imaging systems 

Given the two-lens setup discussed above, we first consider the relatively 
easy problem of characterization of single-particle radiation in the image 
plane. 
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Fig. 10. Optical setup for incoherent OTR imager. For high electron energies, the 
XFEL case, an important fraction of the coherent OTR radiation will be lost through 
the center hole of the mirror. 




Fig. 11. Optical setup for the near-normal-incident OTR screen. The detector image 
plane is slightly tilted to compensate for the tilt of the object plane. 

We introduce the lens transmission function 



icor 



,/2 



Icf 



(37) 



where / is the focal distance of the lens and P is the pupil function of the lens, 
which may account for finite extent of the lens, apodization, aberrations and 
is, in general, a complex function of f. 



We assume that the two lenses in Fig. 12 are identical. Let us explicitly 
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output (image) 
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Fig. 12. A two-lens image-processing system. In order to perform Fourier processing 
of the input signal a transparency and a polarization transformer can be inserted 
in the Fourier plane to suitably modify the Fourier transform of the object. 

derive the field in the output (image) plane of the setup in Fig. 12. In the 
following, it should be clear that we discuss about coherent imaging of 
an extended object. In fact, as we have seen in the previous Section 3, the 
distribution of the OTR field from a single electron is not point-like, but 
rather laser-like and in the XFEL case its extension is macroscopic, in the 
millimeter scale. Standard description of image formation by a lens is based 
on the assumption that optical systems are space-invariant, or isoplanatic 



|35|1 . Once the space-invariant condition has been assumed a linear-system 
treatment can be applied, which consistently considers a lens as linear filter. 
We recall that a linear filter is characterized by convolution equation of the 
form f = h*g, where g is the input function at the object plane, h the impulse 
response and / the output function in the image plane. 

Imaging systems that use coherent light are linear in field amplitude, but are 



space invariant only under certain conditions |36l,l37l,l39l| This means that 
the concept of impulse response or transfer function for coherent imaging 
has only a limited use. In particular, as we will see, a convolution equation 
of the form f = h* g can be applied to lenses only after certain quadratic 
phase factors can be neglected within the field-propagation equation. Then, 



As stressed before, here we discuss about a specific subject, namely coherent 
imaging of extended objects. As remarked in |39fl, "in general, even aberration- 
free thin lenses do not meet isoplanatic condition. Some spatial phase distortion is 
unavoidably introduced, which severely modifies the intensity distribution of the 
image". Up to now, this problem attracted little attention of the Optics community, 
and the practical importance of this subject was recognized only very recently. 
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the (scaled) pupil function plays the role of the amplitude transfer function 
for the system. 

With reference to Fig. 12 we call with z the coordinate on the optical axis, 
z = being the position of the object plane. Thus, the position of the Fourier 
plane is at z = If, and that of the image plane is at z = 4/. 

Within a paraxial treatment in the space-frequency domain, each polariza- 
tion component of the field propagates in free-space from position Zg to 
position z according to 

lnc\z -Zs\J 

For each polarization component, we also introduce the spatial Fourier 
transform of a field distribution 



F{z,u) = j dr'E{z,f^)exp[if -u]. (39) 

The propagation equation for the spatial Fourier harmonics of the field in 
free-space is given by 



tco- 



r - f 



lc\z 



(38) 



F(z,u) = F (0, it) exp 



ICZU 

'~2a7 



(40) 



Note that F (z, it) is the 2D spatial Fourier transform, with respect to trans- 
verse coordinates, of the "field" £(z, r'), which is, in its turn, the temporal 
Fourier transform of the electric field in the space-time domain. As a re- 
sult, F(z,u) is actually a 3D Fourier transform of the electric field in the 
space-time domain, with respect to time and transverse coordinates. It is, 
therefore, a function of the longitudinal coordinate z. 

The relation between the field distribution on the pupil, z = /, and the field 
distribution in the focal plane, z = 2f, is given by 



E(2/,r-}): 



ICO 



Incf 



exp 




dr E{f,r')P{r')exp 



ico{rf • r') 



(41) 



Here E{f, f) indicates the field amplitude immediately in front of the lens. 
Also, Tf indicates the transverse position in the focal plane, i.e. at z = If. 
Use of the convolution theorem and Eq. (40) allow one to write Eq. (41) as 
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E{2f,rf) 



ICO 

Incf 



exp 



i(jo\rf\^ 
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exp 



icfu^ 



Fid -it) , 
(42) 



where we defined 



V{u) = J dfP{r) exp [-if -u] . (43) 

Simple analysis, consisting of a change of variable (u — > -it + corf/{cf)) 
followed by expansion of the quadratic phase factor in Eq. (42) shows that 
the field in the focal plane, Eq. (42), can also be written as 



corf' 



exp 



icu^f 



ICi) 



P{it)exp[iu- rf] . 



(44) 



Assume now that the pupil simply consists of a finite aperture. In other 
words. 



P(r ) = 1 for r < a, P{r) = otherwise . (45) 

We indicate with o the maximal transverse size of objeclH^, which is a 
characteristic transverse scale of the problem. 

We also assume: 



o^a, a^/{Af) » 1 . (46) 

Consider Eq. (44). On the one hand, the smallest structures in the Fourier 
transform of the field in the object plane under the integral sign are of the 
order of w ~ 1 /o, meaning that we can neglect variations of F for frequencies 
w « 1/cr. On the other hand, P enters the integral sign as well, and has a 
width of 1 /a. It follows that, for o ^ a, one can neglect variations of F in u, 
and can simply take F out of the integral sign. Moreover, u <l/a due to the 
width of the pupil. As a result, whenever a^/ (Af) » 1 the quadratic phase 
factor within the integral sign can be neglected and one obtains 



e(2/.*7) 



Not to be confused with the transverse size of the bunch a,-. Here and in the 
following, o is the characteristic size of any object in the object plane. 
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P(r') =1 at r'< a 
P(r')=0 at r'> a 
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Fig. 13. Left: a coherent object with characteristic size a is spatially Fourier trans- 
formed by a lens with focal length / and an aperture of radius a. The pupil aperture 
is placed directly against the lens. When the pupil aperture is much larger than 
a single Fresnel zone, and the object size is a small fraction of the pupil size, the 
diffraction-limited optical system is space-invariant, i.e. isoplanatic. The concept of 
amplitude transfer function can be directly applied to the system, and the relation 
between pupil and amplitude transfer function is straightforward: the scaled pupil 
function plays the role of the amplitude transfer function. Right: the pupil aperture 
is placed behind the lens in the focal plane. In this case, the pupil plays the role of 
a transparency, which sharply limits the range of Fourier components passed by 
the system. Under the accepted conditions (46), left and right configurations are 
equivalent. 

As long as conditions (46) hold, it does not matter if we consider the aperture 
P placed at the lens position, or at any other position between the lens and 
the Fourier plane. This can be understood by inspecting Eq. (47). In fact, Eq. 
(47) is the same as for the case when a finite aperture is placed in the focal 
plane of a lens with non-limiting aperture. The situation is summed up in 
Fig. 13. 

The field in the image plane z = 4/ is obtained by taking Eq. (47) as a new 
object, and propagating the field through the second lens in Fig. 12. One has 



cfu cfv 



E(4:f,ri)^- J dv J duE(^0/-^ + — -fi\P{u) 



X exp 



icv^f 



P{v} exp[iv • ri\ . 



(48) 



Here indicates the transverse position in the image plane, i.e. at z = 4/. 
When conditions (46) hold, one can show that Eq. (48) can be written as 
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E (4:f, ri) = - j dwE |o, ^ - j^j - J duP{u)P{w - u) . (49) 

In fact, u,v <l/a, and the quadratic phase factor in Eq. (48) can be dropped 
due to a^/{Af) » 1. Moreover, one always has \Af{u + v) - < o. Therefore 
ri < o + 2Af/a, and also the linear phase factor v- Ti can be dropped when 
conditions (46) hold. A further change of integration variable yields Eq. (49). 

Now, the autocorrelation integral in du in Eq. (49) is equal to the Fourier 
transform of |P(m)P done with respect to the transform variable -w. How- 
ever, for our particular choice of P, = P and the integral in dit simply 
equals P{-w). As a result 

£(4/,r-:) = - J du-^p(^-trj-£(o,-^j , (50) 

where we changed again w to it for notational reasons. 

The previous analysis of the field in the image plane, shows that the concept 
of coherent transfer function remains meaningful for diffraction-limited op- 
tical system when the pupil size is much larger than a single Fresnel zone, 
and the physical size of the object is a small fraction of the pupil size (see 
conditions (46)). 

Note that both conditions (46) are satisfied in our case of single-electron 
imaging, because we assume that the numerical aperture is NA ~ 0.1, 
A ~ 1 [im. and / ~ 10 cm. This automatically means that a is in the centimeter 
scale, which is much larger than the scale of the coherent OTR pattern, hence 
o <^a. Also, one can see that a^/ (A/) » 1. 

4.2 Image of a single electron 

Let us now consider the problem of OTR pattern imaging from a single 
electron. First, note that Eq. (50) is valid for each polarization component. 
Remembering Eq. (12) and its vectorial character one has 

where Tq indicates the coordinate in the object plane, and the superscript (1) 
indicates the single-particle field. By virtue of the convolution theorem, Eq. 
(51) may also be written as 
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Thus, for our choice of pupil, and when conditions (46) hold, it does not 
matter if we consider the pupil aperture P placed at the lens position, or in 
the Fourier plane. 

The energy per unit frequency interval per unit surface in the image plane 
is given by 



ico{ri - ro) • r/ 



• (52) 
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or equivalently by 
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(53) 
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(54) 



Eq. (53) (or Eq. (54)) is the response to the single-electron source in the 
case when the Ginzburg-Frank formula is valid, when conditions (46) hold 
and, additionally, under the assumption of an ideal lens with a finite pupil 
aperture (P(r) = 1 for r < a and zero otherwise). 

In general, the single-particle field in the image plane, Eq. (51) (or equiv- 
alently Eq. (52)), is non azimuthal symmetric. Note that if P is azimuthal- 
symmetric in the cylindrical coordinate system (r, z), where z is the optical 
axis (i.e. P(f) = P(|r1)), Eq. (54) reduces to 



fir, 



(55) 



where we set ro = for illustration. For an arbitrary offset ro, a generalization 
of Eq. (55) can be obtained substituting r, with |rf - ro|. 

Eq. (53) (or equivalently Eq. (54)) includes information about the optics, 
through P{rf), and about the way the electron input is converted into elec- 
tromagnetic field, through the single-particle field. Because of this, we will 
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refer to I^^\ri), that is the image of OTR produced by a single electron (i.e. the 
impulse response for OTR), as the particle spread function of the syste 
The particle spread function differs from the standard diffraction pattern 
from a point source, which is known in optics as the point spread function of 
the system and is defined as the squared modulus of the Fourier transform 

of the pupil function. Similarly, E {ri) may be referred to as the amplitude 
particle spread function of the system. 

It should be remarked that the expressions for I^^^(r,) given above assume 
that no polarization component is selected. However, they can be easily 
modified to deal with such case. 

It is also interesting to note that for OTR, the direction of the electric field 
depends on the transverse position (because the field is radially polarized). 
The fact that a Bessel /i function enters in Eq. (55) (and not a Bessel /o function 
as for point-source imaging) is actually due to lack of azimuthal symmetry 
of the OTR field. This fact is obviously responsible for a wider particle 
spread function. The lack of azimuthal symmetry can be better appreciated 
considering Eq. (51) (or Eq. (52)) rather than Eq. (53) (or Eq. (54)). Since we 
are concerned with coherent imaging, the former two equations are the ones 
we will be actually dealing with. 



4.3 Image of an electron bunch. Incoherent and coherent case 



Well defined algorithms exist for calculating the image for a complicated 
input signal (i.e. an electron bunch with given transverse charge density 
distribution) in the case of incoherent or coherent radiation. In both cases, 
resolution of the image is strictly related to the particle spread function (Eq. 
(53)) and the amplitude particle spread function (Eq. (51)) discussed above. 

Let us introduce the electron density distribution of the electron bunch, 
p{t, r). Here we are interested, in particular, in the electron density distribu- 
tion of a bunch crossing the bondary between vacuum and OTR screen. Let 
us also call p{a), r) the Fourier transform of p{t, r) with respect to time. 

In the coherent case, the intensity distribution in the image plane is given 
by the squared modulus of the convolution of the amplitude particle spread 
function, Eq. (51), with the electron beam transverse profile at frequency co: 



Sometimes the particle spread function is called single particle function |40I1 . 
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In the incoherent case, the intensity distribution in the image plane is given 
by an ensemble average of independent contributions of the form of the 
particle spread function, Eq. (53) over the electron density distribution. 
These contributions can be ascribed to each electron, and are independent 
of one another. The ensemble average procedure amounts to a convolution 
of the particle spread function with p(0, r) - dt p{t, r) Therefore, one 
obtains: 



(57) 



Note that both Eq. (56) and Eq. (57) reduce to Eq. (53) in the case of a single 
electron, i.e. for p{co,r) - 6{r- fo) exp[icoto], where fo and to indicate the 
electron offset and arrival time at a given longitudinal reference position. 

In practical cases of interest we will also introduce a Fourier-plane mask 
with amplitude transmittance Tm{ff). Thus, Eq. (47) is modified to 



^(^^''f) = ^A^'-^r^^^-^'f^ ■ ^^^^ 

Defining 7~m{u) = f dr'Tm{r') exp[-zr' • it], one should therefore perform the 
substitution 

P{u)^ f du"Piu - u')T„iu') = \ du'Tmiu') ^™ h{a\u-u'\) (59) 
J J \u-u'\ 

in Eq. (57) and Eq. (56). 

In both cases one needs to calculate the integral 



which may be interpreted as a modified expression for the field accounting 
ad hoc for the presence of the spatial-frequency filter associated with the 
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jfinite aperture of the lenses. Eq. (56) and Eq. (57) remain formally identical, 

but we now substitute the single particle field with £,nod- Therefore, Eq. (57) 
can also be written as: 



I{co, ?i) oc J dr"p(0, r - I J du' Tm - i/'j • £ 
Similarly, Eq. (56) can be presented as 



^(1) 

mod 



cfu' 



CO 

\ J 



(61) 



l{<jj, Ti) OC I J" dr'p{co, f -Yi) ^ du' Tm - «' j • £ 



-d) (cfu 

mod — 



(62) 



4.4 OTR particle spread function 



We now want to calculate the particle spread function of the system, which 
is essentially the radiation pattern produced by a single electron in the 
image plane. This is obtained from Eq. (61) or Eq. (62) by substituting the 
transverse electron density distribution with a Dirac 6-function. Discussing, 
for simplicity, the case ro = one has 




Let us consider the simplest case when the Fourier-plane mask is absent, i.e. 
when we account for a pupil which sharply limits the range of the Fourier 
components passing through the system. When one deals with imaging 
properties, one is only interested in relative distributions of the radiation 
intensity. For the present discussion we are only interested in the relative 
distribution of the radiation intensity in the image plane. Rewriting Eq. (63), 
imaging of a single electron by a diffraction-limited, two-lens optical system 
gives 



l^^\co,ri)^Al{co,ri) = 



/ 



de 



02 



(y202 + 1) 



coOvi 



(64) 



Here da = a/f. The resolution of the imaging system is related to the width 
of Ap. The function Ap{co, ri) is, instead, to be considered as a mathematical 
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Fig. 14. The particle spread function of the system (soHd line), Eq. (64), for A = 800 
nm and 9a = 0.1. The point spread function, for which = /(^]\{ri6al{/i)\l{ri6a)^, 
is shown for comparison (dashed line). 

construct, which does not coincide with the field amplitude, being in fact 
the square root of the sum of the intensities along orthogonal polarization 
directions. 



An example of the particle spread function for A = 800 nm and da = 0.1 
is plotted in Fig. 14. In the region of interest for electron bunch imaging, i.e. 
Ti « yA, the result is practically independent on the choice of y for XFEL 
setups. In this region, one may approximate y^d^ » 1. As a result one may 
approximate the right hand side of Eq. (64) obtaining |41l] 



ootid ale 



(65) 



where, as before, we set Tq = 0. 

The OTR particle spread function is quite different compared with the point 
spread function = A^}l[rida/{A)]/{rida)^ for a circular pupil. In particular, 
as one can see from Fig. 14, in the case of the OTR particle spread function 
the resolution is worse. 



Finally, note that the function Ap presented here does not account for the 
peculiar field polarization properties of OTR. The same reasoning done 
to derive Ap starting from Eq. (63) can be done separately considering the 
response to a single electron in the x or in the y polarization direction. In this 
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case, one obtains the following amplitude particle spread functions for the 
orthogonal polarization components of the field in a Cartesian coordinate 
system: 



M cos(d)) 
A^;\cv, r, (P) = -^Apicv, r) (66) 



and 



i,A sm((h) 
A^f{cv,r,(P) = ^i^Ap{cv,r). (67) 



4.5 Method for improving the OTR particle spread function 



One will have the best resolution of the OTR imaging system when the width 
of the particle spread function A^ is minimized, the optimum being the 
point spread function, the dashed line in Fig. 15. Such optima03 situation 
is sketched in Fig. 15. Once the wavelength is fixed, the resolution only 
depends on the numerical aperture of the system. Note that, with reference 
to the system shown in Fig. 15, the field distribution in the focal plane is 
uniform, and the polarization is spatially invariant. As a result, the field in 
the image plane is azimuthal symmetric. It is the response of the system to 
a point source. 

In the OTR imaging case, the situation is different, as one can see from Fig. 
16. The very specific OTR source is not point-like but, rather, laser-like with 
a polarization singularity. As a result, the field distribution in the focal plane 
is not uniform, and the direction of the electric field varies as a function of the 
transverse position. Therefore, in our case we have two separate amplitude 
particle spread functions for two orthogonal polarization directions. They 
are not azimuthal symmetric, because they depend, respectively, on x/r and 
y/r, i.e. on cosine and sine of the azimuthal angle. Therefore, in the image 
plane one obtains a non-azimuthal symmetric response, which worsens the 
resolution. Only if one sums up the intensity patterns referring to the two 
polarization components for a single electron, one obtains an azimuthal 
symmetric distribution, since (x^ -I- y^)/r^ = 1. 



Actually, in the coherent case the resolution can still be improved by introducing 
pupil apodization, see e.g. [42]. Here the words "optimal situation" and "best 
resolution" are to be understood in a narrow sense. Namely, as we will see, one 
can avoid blurring of the OTR image due to the particular particle spread function 
of OTR and reduce the problem to the standard case when point-like sources are 
considered. 
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Fig. 15. Two-lens imaging system with a point source. 
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Fig. 16. The response of the system in the case of Optical Transition Radiation 
emitted by a single electron. 

Methods to improve the particle spread function for OTR imager have been 
studied previously, e.g. in | j4ol ]. The particle spread function is improved 
by selecting one polarization component of the field and by introducing a 
Fourier filter with the help of a "round, opaque mask placed in the back focal 
plane of the lens to prevent the passage of photons at angles smaller than 
the mask angular acceptance" [40. ]. Selection of one polarization component 
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Polarization transforming and Fourier filtering to improve system performance 
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Fig. 17. OTR particle spread function synthesis by a Fourier-plane filter and a 
polarization transformer. 
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Fig. 18. A practical arrangement of the coherent OTR image-processing system. 

of the field can improve the resolution in one projected direction {x or y) 
only, but not in both simultaneously. Moreover, the other projection yields 
a much worse resolution. 

Here we propose a method to improve the particle spread function up to the 
best possible point spread function. Our method is based on the introduction 
of a filter transparency (mask) with a particular amplitude transmittance 
and a polarization transformer in the Fourier plane. 
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polarization 
transformer 




Fig. 19. Schematic plot illustrating the transformation of a radially polarized beam 
into a linearly polarized beam. Instead of a continuous spatially varying retarder, 
a sectioned device can be used, where each section has different orientation of the 
optical axis of the half -wave plate. 

A sketch of the scheme is given in Fig. 17. Use of a Fourier-plane mask and a 
polarization transformer allows one to get back to the case shown in Fig. 15, 
where the field in the focal plane is uniform, and the polarization is spatially 
invariant. As a result, one improves the OTR particle spread function to 
obtain the usual point spread function. Our scheme can be implemented in 
an OTR imaging system as shown in Fig. 18. Down here we will consider in 
more detail the effect of both polarization transformer and amplitude mask. 



4.5.1 Polarization transformer 

The function of the polarization transformer is to change the radially po- 
larized OTR into linearly polarized radiation as shown in Fig. 19. Here 
we will consider, as an example of technical realization of a polarization 



transformer, a device recently demonstrated in [|43l,l44l] 



When a radially polarized beam passes through the transformer, each sector 
rotates the polarization vector by different angle. As a result, the polarization 
distribution just after the polarization transformer is nearly-linear. 

We can account for the polarization transformer in our expression for the 
field by replacing Eq. (52) with 



where we set ro = as before. 



exp 



icoirj-rfY 



(68) 
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4.5.2 Fourier-plane mask 



Consider now the amplitude particle spread function, Eq. (68). Suppose 
that we insert a transparency in the Fourier plane, whose amplitude trans- 
mittance essentially yields an extra factor r. In other words, we substitute 
P{rf) in Eq. (68) with the product P{r)T,„{r), with P(r) = cirC(,(r) defining the 
a limiting aperture for the system. With the help of the cylindrical coordi- 
nate system (r, (p) we can integrate Eq. (68) over azimuthal coordinates, thus 
obtaining 



a 

±(1) 



ciy^r^f + P) 

u ' 
Let us now consider a mask characterized by 




(69) 



T,n{r) = To • r/a , (70) 

where To is a common amplitude-attenuation coefficient (equal to unity or 
smaller). In this case, assuming To = 1 for simplicity, Eq. (69) is replaced by 



2ey^ \ IcoriTf^ 



E ,4/,.) = ^/..;.J{^^|;.^^|. (71) 

^ 

As noted before, in the region of interest for electron bunch imaging, i.e. 
r, « yA, we can neglect the term in in the denominator of the integrand 
in Eq. (71), which gives 





Finall>liE 



(72) 



Note that propagating the field in Eq. (12) from the object plane (z = 0) to the 
Fourier plane (z = 2/), one obtains a phase shift n. This is the phase shift between 
"impulse", the OTR field on the screen and "response", the OTR field in the image 
plane (the geometrical phase shift exp[-zcuz/c] having already been accounted for 

in the relation between E and E). Such phase shift represents the analogous of the 
Gouy phase shift in laser physics. For azimuthal-symmetric beams, the Gouy phase 
shift is known to be n/2. However, this result is not valid for Eq. (12), where the 
cartesian components of the field depend on the azimuthal angle. After propagating 
to z = 4/, the result is the usual Gouy phase shift n/2. 
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Fig. 20. The radial profile of the OTR field from a Gaussian electron bunch in the 
Fourier plane, behind polarization transformer and linear mask. 



Rewriting Eq. (73) yields the well-known expression for the amplitude point 
spread function: 



which is the diffraction pattern of a circular aperture, where da, as usual, is 
the angular dimension of the pupil aperture. 

Note that Eq. (74) is only valid for some range of r,. In fact, the validity of Eq. 
(74) for all values of would mean uniform field distribution amplitude in 
the Fourier plane, while on the optical axis we always have zero amplitude. 
With our mask profile, Eq. (70), the validity region of Eq. (74) is for < 
r, <c yA. This corresponds to an improved pattern of the field profile in the 
Fourier plane up to frequencies of order l/(yA). 

In principle, one may use a more complicated mask profile to improve 
the pattern in the Fourier plane up to frequencies even much smaller than 
l/(yA). Yet, we can demonstrate that for the XFEL case our simple choice of 
linear transmittance provides a sufficient accuracy. This fact is shown in Fig. 
20 and Fig. 21 for the 2 GeV case, with a - 3 cm and / = 30 cm. Fig. 20 shows 
the field profile generated by an electron bunch with transverse dimension 




(73) 




(74) 
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Field profile in the image plane 

Transverse charge density distribution (Gaussian) 




r. (^im) 

Fig. 21. The radial field profile in the image plane (solid line) compared with the 
Gaussian electron charge density distribution (dotted line). In the inset we show 
the same plots on a different scale. 

Or = 30 [im in the focal plane of the two-lens imaging system. Note that 
the field has zero amplitude on the optical axis. This fact corresponds to 
deviations of the tails of the field distribution in the image plane relative 
to the Gaussian shape, as shown in the inset Fig. 21. However, as one can 
see, the field profile in the image plane shown in Fig. 21 is reproducing 
the bunch profile quite accurately. A Gaussian fit of the field profile with a 
Gaussian function yields crf^'^ = 29.86 |Um. The relative difference between 
Or and crf^'^ is thus about 0.5%. 

4.6 Resolution analysis for incoherent and coherent imaging 

No conventional resolution criterion can be applied for the particle spread 
function in Eq. (63). However, with the help of our method we improved 
the particle spread function to obtain the ultimate resolution, so that the 
amplitude particle spread function is now equal to the amplitude point 
spread function, Eq. (74). 

Instead of dealing with the amplitude particle spread functions in Eq. (66) 
and Eq. (67), we deal with the solution of a simple diffraction problem, 
where the field produced by a point source is diffracted by a circular pupil. 
Therefore, the resolution analysis is reduced now to the theoretical f rame- 
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work of standard incoherent or coherent imaging theory (see e.g. ||450). One 
can take advantage of well-known resolution criteria like the Rayleigh or 
Sparrow criteria, or alternatively, of a formulation of the resolution problem 
in terms of Optical Transfer Function (OTF), incoherent case or Coherent 
Transfer Function (CTF), coherent case. 

In principle, our technique could be applied both for incoherent and co- 
herent radiation. However, in the incoherent case, while spatial resolution 
improves towards the point source, limitations arise due to the small num- 
ber of photons available. As a result, the method described above is better 
applied for the coherent imaging case. 

When we deal with coherent OTR imaging, limitations of techniques to 
improve the particle spread function due to a small number of photons can 
be avoided, given the much larger available photon flux. 

As seen before, processing the spatial spectrum and polarization in the 
Fourier plane allows us to reduce a vectorial problem to a scalar one, so 
that, instead of dealing with the amplitude particle spread functions in 
Eq. (66) and Eq. (67) we consider the scalar diffraction pattern in Eq. (74). 
Thus, the imaging problem is reduced to an imaging problem for coherently 
radiating point sources. 

We note with [ jisl ] that the Rayleigh criterion is not suitable for defining the 
resolution in the case of coherent imaging, because it tends to underestimate 
the minimal distance where resolving two point sources is still possible. 
An acceptable resolution criterion turns out to be, instead, the Sparrow 
criterion, stating that the minimal separation of two points on the OTR 
screen is given by d - 4.6 • A/ da- This value may be compared with that 
given by the Sparrow criterion for incoherent radiation, i.e. d - 3.0 ■ A/ ©a. 
Such comparison suggests that incoherent imaging techniques would yield 
(in the case of point emitters) an ultimate resolution about 1.5 better than that 
in the coherent case. However, as explained before, techniques to improve 
the particle spread function in the incoherent case are limited by the number 
of photons available. 

Finally, it should be clear that the definition of resolution is somewhat 
arbitrary, as it consists in a single number, a figure of merit to be extracted 
according to some criterion or algorithm from the actual image. A more 
quantitative approach to the resolution problem would be to use the CTF 
representation, where the point spread function is given in terms of its 
spatial Fourier transform. 

Since one has 
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coda, 



(75) 



one recovers the result that the pupil behaves like a spatial filter. Spatial 
frequencies up to Oaf A are allowed to pass through the aperture without 
distortion, while higher frequencies are blocked. The resolution of the image 
of the electron bunch reduces to the standard diffraction-limited resolution 
of a point source. 



Let us consider, as a numerical example, our particular case of an XFEL 
with an ORS setup. Here we are dealing with coherent imaging. As shown 
above, an acceptable resolution criterion for the coherent imaging case is the 
Sparrow criterion, which defines the resolution as d - 4.6 • A/Sq. According 
to the Sparrow criterion, for our case, i.e. for A = 800 nm and 6 a = 0.1, we 
obtain d - 6 /.tm. 



It should be noted that both Sparrow and Rayleigh criteria can only be 
applied in the case of a Fraunhofer diffraction pattern from a lens aper- 
ture. They cannot be directly applied when one deals with a complicated 
OTR particle spread function like that in Eq. (65). Therefore, up to now, 
the diffraction-limited resolution for OTR imaging has been based on the 
estimation of an "effective width" of the OTR particle spread function. 



For example, in order to estimate the diffraction-limited resolution for the 
incoherent OTR imager, authors of [ jsl ] fit the particle spread function, Eq. 
(65), to a Gaussian function and calculate the rms width of the Gaussian 
fit. They conclude that, for A = 800 nm and Oa = 0.1, the particle-spread 
function given by Eq. (65) has a width of about 6 /.tm, and they consider this 
as a measure of the diffraction-limited resolution for their system. However, 
using their method, one concludes that the resolution for a point source is 
about three times better than what is predicted by the Rayleigh criterion, 
which is given hy d - 3.8 • A/ da- In fact, a resolution d - 1.2 ■ A/ da was 
reported iri [34] for the incoherent case and for a point-source. For example, 
following 134(1 , one would obtain a resolution d — 2 [im , while according to 
the Rayleigh criterion the resolution is d - 5 /,im. The difference is due to 
the use of the width of the point spread function to define the resolution of 
the system in |34jl . 



The main result reached in this Section is an optimization of the particle 
spread function of the system, so that the imaging problem can be reduced 
to the usual (coherent or incoherent) imaging theory for point-like radiators. 
This result is summarized in Eq. (74), presenting the amplitude particle 
spread function of the system after processing, which is equivalent to the 
amplitude point spread function. 
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4.7 Comparison ofOTR and undulator sources for electron bunch imaging 

We have seen that OTR pulses are intrinsically difficult to deal with, due to 
the presence of the polarization singularity of the OTR field from a single 
electron, resulting in the space-variant polarization and in the singular be- 
havior of the field amplitude. These difficulties can be avoided for instance 
if one uses coherent radiation emitted by a modulated electron bunch in the 
radiator-undulator of the ORS, where the polarization is space-invariant, 
and the field amplitude has no singular behavior. Yet, coherent undulator 
radiation is not suitable for imaging applications, while coherent OTR is. 
In fact, coherent undulator radiation is highly coUimated within an angle 
VA/Ljy, La, being the undulator length, which in our case is smaller than 
a milliradian. Therefore, effectively, the numerical aperture of the imaging 
system is limited by the divergence of the undulator radiation, defining the 
resolution of the system. As a result, an electron bunch of a tens-micron- 
size cannot be resolved with the help of undulator radiation. In contrast to 
this, the OTR radiation pulse from a single electron has a non-limited an- 
gular distribution, and the resolution will only be limited by the numerical 
aperture of the optical system. This basic characteristics of OTR plays an 
important role in both coherent and incoherent imaging. 



5 Coherent OTR imaging for characterizing electron bunches for XFELs 

In the previous Section we have shown that, due to the particular features 
of the OTR source, the OTR image of a single electron significantly deviates 
from the image of a point source. This fact causes problems concerning 
the measure of the charge density distribution. We demonstrated that the 
resolution can be improved with the help of a polarization transformer 
and a specific transparency in the Fourier plane at the cost of reduction in 
photon density. Techniques described in the previous Section can improve 
the OTR image from a single electron up to the standard diffraction pattern 
from a point source. We shall now discuss some practical aspects which 
are specifically related to the characterization of electron bunches in an 
XFEL system, such as 3D electron bunch structure, signal-to-noise ratio, 
aberrations and alignment of mask and polarization transformer. 

5.1 Electron bunches with spatio-temporal coupling 

The intensity profile in the image plane is readily calculated. In our case of 
interest radiation is coherent. Therefore, we must convolve, in the space- 
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Fig. 22. Normalized horizontal and vertical emittance as a function of the longitu- 
dinal position within the electron bunch taken from fs']. The dashed line indicates 
the peak-current profile. 
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Fig. 23. Vertical (left) and horizontal (right) projections of the electron bunch density 
distribution, [24]. 

frequency domain, the field given in Eq. (73) with the Fourier transform of 
the electron density distribution p{a), r), and take the squared modulus to 
get the energy radiated per unit spectral interval per unit surface 



dcodS 



dr'p(a;,r'-rO-/il — 



(76) 



When dealing with XFEL applications, it is important to realize that the 
electron-bunch slice emittance changes along the bunch, as one can see 
from the example shown in Fig. 22, which refers to the European XFEL [3]. 
Note that even if the slice emittance were constant, some mismatch is always 
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Fig. 24. Effect of the bandpass optical filter on recording the bandwidth-limited 
optical replica pulse by a single detector pixel. 

present, leading to a dependence of the electron bunch transverse size on 
the longitudinal coordinate (see Fig. 23). A comparison of Fig. 22 with Fig. 
23 shows that even in the part of the electron bunch where the emittance 
does not vary much there are significant changes in the transverse size. 
As a matter of fact, XFEL beam formation systems will generate electron 
bunches that exhibit a coupling between space and time even in the case 
of nominal operation. As a result, optical replica pulses will also exhibit 
coupling between space and time, i.e. the electric field components in the 
space-frequency domain cannot be f actorized as a product E{<jo,r) - f{f)g{<jo) . 

The detector automatically performs an integration over time of the intensity 
I{t,ri) in the space-time domain, which can be extended over all times, 
because the detector is not fast enough to resolve a single radiation pulse. 
Such integration over times is equivalent, due to Parseval's theorem, to 
an integration over all frequencies of Eq. (76). Therefore, the detector will 
record the energy per unit surface 



det 



dS 



oo 

(fi) oc J" ^^Ij" df^p{co,r' -ri)A{co,r'] 



(77) 



Let us define po{t, r), as the electron density distribution of the unmodulated 
electron bunch. In the case of a bunch modulated with frequency Wm, poih ^ 
and p{t, r) are linked by 
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Fig. 25. Schematic diagram of coherent OTR imaging of electron bunch without 
bandpass optical filter. 
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Fig. 26. Schematic diagram of coherent OTR imaging of electron bunch with band- 
pass optical filter. The setup with bandpass optical filtering can be used to recon- 
struct the (x,y) electron bunch projection. 



p{t,r) = po{t,r)[l + afCos{cOmt)] , 



(78) 



where flj is the modulation level. Let us also call Pq{(jJ, r) the Fourier trans- 
form of po{t, r) with respect to time. 
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Fig. 27. Setup of high-resolution coherent OTR imager with bandpass optical filter 
to reconstruct the (x,y) electron bunch projection. 

Since the adiabatic approximation applies, the function p(a;, r) has a narrow 
bandwidth t^cojoj « 1. Thus, we can write pioJ,r) = pQ{a} - oj„„r)af, a/ 
being the modulation amplitude. Note that here we assume Uf = const, an 
assumption justified in Section 2, where we discussed that the main goal of 
the ORS setup is to produce an exact optical replica of the electron bunch. 
Since po{t,r) is a positive function and Uf is constant, the optical pulse is 
automatically bandwidth limited^ 

The situation is described^ in the time domain by Fig. 24 (top) and Fig. 25. 
The fact that p{co,r) = po{A(jj,r)af , where Aco = co - cOm, is implied by the 
validity of Eq. (78) with = const. Note that iiuf were not constant, p{a), r) 
would be a convolution of po{Aco, r) with the Fourier transform of uj. 

In Fig. 24, the envelope of the optical replica pulse seen by a single pixel of 
the detector is shown as a function of time. This function is integrated by 
the detector pixel over a given temporal interval, which is longer than the 
duration of the pulse. As shown in Fig. 24, the result is proportional to the 
time-integrated squared modulus of the amplitude of the pulse envelope. In 



In order to ensure a bandwidth-limited optical pulse we need that the seed laser 
be bandwidth-limited along the electron-bunch pulse. However, even a bandwidth- 
limited seed pulse can yield, in practice, to a non bandwidth-limited electron den- 
sity distribution. This is the case e.g. when there is a given energy chirp within the 
electron bunch. We will discuss this situation in detail later on. For now, we put 
attention that Eq. (79) is actually based on the use of a weaker assumption than 
af = const, namely \af \ = const. 

Notations in figures are slightly different compared to those in the text for editing 
reasons. However, all relevant definitions are included. 
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its turn, the pulse envelope can be written as a convolution between po{t, r) 
and the point spread function AicOm, where the point spread function 
is calculated at co — cOm because the adiabatic approximation holds. The 
detector thus records 



dS 



{Vijoc J'^i J dr'pQ(t,r' -ri)A{cOm,r') 



(79) 



which is equivalent to Eq. (77) by means of Parseval's theorem. Neglecting 
the blurring one simply obtains 



det 



dS 



iri)oc{pl)^ j dt\po{t,-rif . (80) 



The right hand side of Eq. (80) is an integral in time of the squared modulus 
of the charge density distribution, i.e. it is a quadratic projection, along the 
temporal axis, of the charge density distribution. 

Note that in the case of incoherent OTR imaging (still neglecting the blur- 
ring) one would sum the intensity of all electrons independently. As a re- 
sult, in order to obtain the energy absorbed by a detector pixel at a given 
transverse position f), we need to sum over all the electron arrival times. 
Therefore, the signal from each pixel will be proportional to the integral in 
time of po(i, ri), i.e. it is a linear projection of poit, ri) along the temporal axis. 

Both linear and quadratic projections can be taken advantage of. They can 
be used for comparison with results of numerical simulations and they are 
both sensitive to perturbations of the transverse size of the bunch, which 
we need to monitor. 

However, also in the coherent OTR imaging case it is possible to obtain the 
linear projection of the charge density distribution, instead of the quadratic 
projection, by inserting a narrow-band optical filter with bandwidth much 
smaller than the inverse duration of the optical pulse, and centered at the 
modulation frequency co,,, . Calling with Q{co - cOm) the filter transfer function, 
one obtains the following modification to Eq. (77) 



dS 



00 

(fi) oc J ^^\J dr'p{co,r' -ri)Q{co - cOrn)A{co,r') 



(81) 

EXie to the narrow-band nature of Q, and remembering that due to the 
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adiabatic approximation p{a), r) = po{a) - cOm, r)af, both p and /i can be taken 
out of the integral in dco, thus yielding 



dW, 



dS 



dco \Q{co - C0rn)\^ • 1 drpo{0, f - ri)A{co„, r'] 



(82) 



In the time domain, the envelope of the optical replica pulse (see also Fig. 
24) coincides with the non-modulated electron density distribution, po{t, f). 
One has 



po(0,n) = J dtpoihrd. (83) 

— oo 

Note that po(0, ri) is a positive function, in contrast with the complex- valued 
function po{Aa},fi} for Aco + 0. One sees that the detector just records the 
squared modulus of linear projection (in time) of the charge density distri- 
bution. 

The same conclusion can be reached directly in the time domain by inspect- 
ing Fig. 24 (bottom) and Fig. 26. The optical-replica pulse is stretched by 
the optical filter to a duration of 1 / (Ao;) » Xp, with Xp the duration of the 
original optical replica pulse. The result of this operation is another pulse 
whose envelope is given by the convolution between the envelope po(0 
and the Fourier Transform of Q. Since l/(Aa;) » Xp, the detector pixel sees 
the integrated intensity, that is just given by Eq. (83). A possible setup for 
high-resolution OTR imaging for XFEL setups including a bandpass filter 
is shown in Fig. 27. 

In closing, it is interesting to note that in the incoherent case one sums up 
intensities from single particle contributions. For example, in the case of 
the usual incoherent OTR setup in Fig. 10 (with no image processing), the 
energy per unit surface in the image plane is given by: 



^pymcoh 
get 

dS 



oo 



(84) 



which already presents a linear projection of the charge density distribution. 
From Eq. (84) we see that, in the incoherent case, the spectral energy density 
per unit surface is proportional to p(0, ri) (neglecting the blurring), i.e. to 
the Fourier transform of the charge density distribution at zero frequency. 
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This follows from an average over ensembles. In Eq. (84), p(0, ri) may also 
be presented as p{0, ri) = j p{t, r)dt. Then, based on Eq. (78), which is valid 
in general, due to adiabatic approximation we have J p{t, r)dt = j po{t, r)dt, 
for arbitrary modulation strength fly^(i, r). 



5.2 Effect of aberrations on the -performance of coherent optical systems 

When an optical system acts as a linear, space-invariant filter for a complex 
field amplitude, it is referred to as isoplanaticH^ When a coherent optical 
system is isoplanatic, the effect of aberrations on its performance can be 
treated within the framework of a standard theory However, in general, 
even aberration-free optical systems are not isoplanatic. Some spatial phase 
distortion is unavoidably introduced, which severely modifies the intensity 
distribution of the image. 

In section 4 we discussed the effect of non-isoplanatism on coherent image 
formation. We demonstrated that thin aberration-free lenses can only be 
considered as isoplanatic system if limitations (46) are satisfied. Here we 
discuss the effect of non-isoplanatism on coherent imaging of extended 
objects when the thin lenses composing the optical system are no more 
aberration-free, and we discuss conditions when the standard isoplanatic 
theory can be applied. 

Consider Fig. 28. Aberrations, i.e. wavefront errors, can be imagined as 
equivalent to a phase-shifting plate located at the aperture, which deforms 
the wavefront. The complex amplitude transmittance Tir') of the phase- 
shifting plate is given by 



T{?) = exp[zO(f*)] , (85) 

where 0(f*) characterizes the aberration. As before, we indicate with P(r') 
the pupil function characterizing the lens aperture. The complex function 
P(r')T(r*) is referred to as the generalized pupil function. 

When an aberrated optical system is isoplanatic, the amplitude point spread 
function of the system is space-invariant, and is simply given by the Fraun- 
hofer diffraction pattern of the generalized pupil function P{r')T{r^), which 
is essentially its Fourier transform. The generalized pupil function itself 

Note that in ||37l]. conditions for isoplanatism are discussed for a simple case 
of an aberration-free thin lens with a finite extent. We did not find, in literature, 
any discussion about conditions for isoplanatism for lenses with aberrations and 
apodization. 
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phase-shifting plate that 
accounts for aberrations 




P(r') =1 at r'< a 
P(r')=0 at r'> a 
T(r') = exp[iO(r')] 
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Fig. 28. Isoplanatic system in case of aberrations. When wavefront errors exist, we 
can imagine that a phase-shifting plate is placed at the aperture, thus deforming 
the wavefront. When the characteristic scale of the wavefront errors a' is much 
larger than a single Fresnel zone, and the object scale o is much smaller than a' , 
the optical system with aberrations is isoplanatic. The amplitude transfer function 
concept can be applied to such system and the generalized pupil function plays the 
role of the amplitude transfer function. In this case, left and right configurations 
are equivalent. 

plays the role of the amplitude transfer function for the system. In this case, 
the spatial bandwidth of the amplitude transfer function is limited by the 

finite pupil aperture P(r')- The effect of aberrations consists in the introduc- 
tion of phase distortions within the bandwidth of the amplitude transfer 
function. These phase distortions can have a severe effect on the fidelity of 
the imaging system. 

Conditions for an aberrated optical system to be isoplanatic are easily de- 
rived on the basis of Section 4.2. Let us call with a' the characteristic scale 
of the wavefront error. Suppose now that a' is much larger than a single 
Fresnel zone, and that the object scale o is much smaller than a'. In this case, 
the reasoning in Section 4.2 can be repeated replacing conditions (46) with 



a « fl' , fl'V(/t/) » 1 . 



(86) 



Then, both Eq. (44) and Eq. (50) can be recovered, and the optical system is 
isoplanatic. 



More generally, geometrical aberrations are described by the Siedel wave- 
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front representation ||46(]: 



W, e) = ho (^) + ho (^) + hi Q cos(0) + hi (^) cos\d) , (87) 

where &20/ ^io> ^31 and ^22 are the amounts of defocus, spherical aberration, 
coma, and astigmatism. The quantities r' and Q are defined through 



r'^ = {x'f + {y'f , e = tan-\x'/y') . (88) 

For the sake of illustration, it is interesting to discuss a particular example for 
the case of defocusing aberrations. In this case it is customary to introduce 
a quadratic phase error O(r') in the form 



(89) 



In Eq. (89), the characteristic scale a' is linked to the aberration strength by 
a''^ ~ a^/b2o- Therefore, from the second condition in (86), one has ho ^ 
a^/{Af), which means that ho should be much smaller than the number 
of Fresnel zones on the aperture. Similarly, from the first condition in (86) 
one has ho ^ cP-/o^, meaning that the aberration strength should be much 
smaller than the ratio between the aperture area and the object area too. Note 
that these conditions actually set criteria for the system to be isoplanatic in 
terms of aberration strength, diffraction, and geometrical dimension of the 
source compared with the pupil aperture a. Finally, note that in general, for 
an aberration of order n, the aberration strength should satisfy the conditions 
bl^m « ^jV(A/) and bl'^l « a^o^ 

In our case of coherent OTR imaging, the impact of chromatic aberrations 
on the system resolution can be avoided by taking advantage of the narrow 
bandwidth of the optical replica radiation pulse. 



5.3 Sensitivity to displacement of mask and polarization transformer away from 
the focal plane 

Up to now we discussed two cases: a pupil consisting of a finite aperture 
only, and a pupil consisting of a finite aperture with aberrations. 

In the case of a pupil without aberrations we saw. Fig. 13, that conditions 
(46) guarantee that the system is isoplanatic. The field in the Fourier plane 
is the product of the Fourier transform of the field at the object plane and 
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systems with inverse apodized pupil 
do not meet isoplanatic condition 

pupil aperture 




-a a r 



Fig. 29. Introduction of a mask in the pupil. The problem is related to the so called 
inverse apodization. In this case the generalized pupil function P{r)T„{r) has no 
characteristic size and, generally speaking, the optical system is non-isoplanatic. 
For coherent imaging of extended object some spatial phase distortions is intro- 
duced, which severely modifies the low (spatial) frequency components of the 
object spectrum. 

the pupil function (see Eq. (47)). Based on Eq. (47), we demonstrated that 
the position of the aperture can be located anywhere between lens and focal 
plane. 

This reasoning was extended to the case of aberrations in the previous Sec- 
tion 5.2. There we saw. Fig. 28, that the same conclusions can be drawn 
whenever conditions (86) hold, which have the same mathematical struc- 
ture of conditions (46), with the only difference that they now include the 
characteristic scale of the wavefront error a' and not the size of the pupil 
aperture a. 

Both the problems that we discussed are related to coherent image formation 
theory. This knowledge can be applied to treat related technical issues. 

The goal of this subsection is to deal with the problem of alignment of 

optical elements along the optical axis. As was shown in Section 4.1, Fig. 
13, a lowpass spatial frequency filter under conditions (46) is insensitive to 
the position of the filter (which can be located anywhere between lens and 
focal plane positions). 

It is now a good time to generalize our considerations, and to examine the 
sensitivity to displacement of a Fourier mask of particular interest. The mask 
that we want to consider has a profile given in Eq. (70), Fig. 29. This profile 
changes the shape of the optical transmission of the optical system. Such 
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changing is known in literature as apodization. Our case is actually known 
as inverse apodization (see e.g. fs^) because it suppresses the central part, 
and not the periphery of the optical signal. 

Considerations made above will lead us to find isoplanatic conditions for a 
pupil with inverse apodization. The pupil function in this case given by the 
product P{r)T„i{r). 

We start with Eq. (44) assuming that the second of conditions (46) is satisfied. 
This time, we analyze Eq. (44) in a different way. Expressing F(0, it) in terms 
of a Fourier integral, neglecting the quadratic phase factor in because of 
the second of conditions (46), and integrating in duwe obtain 



£(2/,r-}) = ^ / dr'^^(0>r'^)P(^f-^'^)Tmirf-r)exp 



icor' ■ Yf 



(90) 



Now E ^0, r' j is our object, whose extension we estimate with the character- 
istic size a. 

We begin by analyzing the case without mask, i.e. Tm{r) = 1, using Eq. (90). If 
also the first of conditions (46) is verified, o ^ a, then we can have o^/ (Af) ~ 
1, meaning that the pupil is in the near zone. Since r' < o (otherwise the 
field at z = under the integral vanishes), due to the exponential factor in 
Eq. (90), one is actually limited to the range Tf < a. Then, P{ff - r') = 1, 
because \ ff - r'\ ^ o <^ a. Therefore, the pupil function P can be taken out of 
the integral sign in Eq. (90), meaning that it does not perturb the image of 
the object. In the case a^/{Af) « 1, since r' < a, the exponential term in Eq. 
(90) is of order unity for maximal values ofr^ ~ Af/o » o. Now the pupil 
operates as a cutoff filter, introducing a little blurring of the sharp edge on 
a scale o ^ a. 



Now let us turn back to the case a mask with inverse apodization is present. 
We can appreciate the difference compared to the previous case. When 
o^/{Af) ~ 1 we cannot take the generalized pupil function P{r)Tm{r) out 
of the integral sign anymore, because on the scale tj o (determined by 
imposing that the exponential term in Eq. (90) be at most of order unity) we 
have significant variation of the generalized pupil function through T,„(r), 
which does not present any characteristic scale of interest. When instead 
o^l{Af) « 1, we have the characteristic scale tf ~ Af la » o. Then, with 
accuracy a^/{Af) we can take the generalized pupil out of the integral sign, 
because for these large values of the result of the integration changes little 
within excursions of r' < o. 
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Fig. 30. In our case of interest, the largest transverse electron-bunch size is small 
enough for its Fraunhofer diffraction pattern to be formed at the pupil plane. As 
a result, the most important part of the object spectrum (mid-range and high spa- 
tial frequencies) meet isoplanatic condition. For this range the generalized pupil 
function plays the role of amplitude transfer function. Perturbations of low fre- 
quency components of the object spatial spectrum can only modify the intensity 
distribution on a scale of much larger size compared to the bunch size. 

A qualitatively different result is reached, because low frequencies are al- 
ways perturbed, while in the case of a simple pupil P{o) = 1 for cr «: a. It is 
therefore impossible to apply the concept of amplitude transfer function to 
the system in Fig. 29. In other words, the system is not isoplanatic. 

Note that inverse apodization is discussed in literature only in relation with 
incoherent imaging. If we were imaging an incoherent collection of point 
sources the problem of isoplanatism would not exist at all, and the auto- 
correlation function of the pupil actually would serve as transfer function 
of the system. However, this observation applies to an object consisting of 
an incoherent collection of point sources only, and not to the case of in- 
coherent OTR imaging L where we deal with an incoherent collection of 
laser-like sources. In this case, each source has a finite transverse extent, and 
the problem of isoplanatism exists exactly as in the coherent case. 

Clearly, as soon as we consider inverse apodization for coherent imaging, 
the situation becomes complicated. Nevertheless the isoplanatic conditions 
(86) can be satisfied for mid-range and high spatial frequencies of the object 
spectrum. Fortunately, in the XFEL case perturbations of the low spatial 



This example is not unique, think e.g. to the case of Synchrotron Radiation 
sources. However, the OTR case is more complicated due to the polarization sin- 
gularity of the source. 



69 



far zone asymptotic limit arbitrary 
positions 



Fresnel diffraction effect 
behind the lens 



masic PT 




IL 

210 



f 



object 
plane 



lens 



Fourier 
plane 



Fig. 31. When the transverse size of the electron bunch is small enough for its Fraun- 
hofer diffraction pattern to be formed at the pupil plane, mask and polarization 
transformer can be installed at arbitrary position within the focal distance. 

frequency components lead to the introduction of some background only, 
but not distortions in the electron-bunch image (see Fig. 21). 

Considering the specifics of our problems as illustrated in Fig. 30, we are 
interested in spatial frequencies higher than 1 /cr, where o is the largest size of 
interest in our object. Thus, we are not interested in spatial frequencies lower 
than 1 /cr, which are modified by a shift in the mask position. This reasoning 
yields an effective scale a' on the inverse apodized pupil of order a' ~ A//cr. 
Such scale can be used in conditions (86), which are now equivalent to 



When condition (91) is satisfied, i.e. when the lens is in the far zone with 
respect to the object structure of size o, conditions (86) are satisfied too, and 
one can shift the mask from the focal plane without appreciable effects. 

The far-zone condition (91) means that the transverse size of the object 
is much smaller than the size of the Fourier pattern. Intuitively, as it can 
be seen from Fig. 31, when placed in the far-zone, the lens transforms a 
spherical wavefront from a point source into a plane wave. When the object 
has some transverse dimension o, such that a^lKf <ic 1, one can neglect 
Fresnel diffraction effects at the mask placed anywhere between pupil and 
Fourier plane. Thus, with accuracy cr^/A/, the wavefront behind the lens 
can be still considered plane and does not change at any position between 
lens and focal plane. 

As a result, in the far zone, there is a very low sensitivity to mask and 




(91) 
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Fig. 32. Segmented half-wave plate with octonary sectors. Arrows schematically 
show the direction of the fast axis of the crystal in each sector. 

polarization-transformer displacement from the focal plane. 



5.4 Sensitivity to transverse displacement for mask and Polarization Transformer 



With the help of a polarization transformer and a Fourier-plane mask, as 
schematically illustrated in Fig. 18, we have seen in the previous Section 
that spatial spectrum and polarization of OTR radiation can be conveniently 
processed. 

A schematic representation of the polarization transformer is shown in Fig. 



32. The polarization transformer demonstrated in | j43|, l44j1 is composed of 
eight 1431] up to twelve | |44|l sectors of half -wave retardation plates, each one 
with different orientation of the crystal's "fast" axis. The linear dimension of 
the device is of the order of a centimeter. A photonic crystal segmented half- 
wave-plate is the perfect radial-to-ltnear polarization converter for coherent 
OTR imager. For a twelve segments device, the linear polarization purity 
can be as high as 99%, with transformation efficiency of about 92%. Due 
to the good quality of the photonic crystal segmented half -wave-plates, the 
incident wavefront is negligibly perturbed. 

Both mask and polarization transformer are endowed with rotational in- 
variance. Their centers of rotation should coincide with the position of 
the optical axis, where the center of the Fourier transform is located. The 
accuracy needed in the alignment of the Fourier-plane mask and the po- 
larization transformer is related to the fact that we wish to obtain a proper 
image within region B in Fig. 7, i.e. r, < 100 jUm in the image plane for a 
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focal distance / ^ 30 cm. This maximal value of r, corresponds to a region in 
the focal plane given by > A//r, - 400 /im. Within this accuracy, the elec- 
tron bunch image will not be sensitive to transverse displacements of the 
center of the mask in the Fourier-plane and of the polarization transformer 
with respect to the optical axis. This estimation also specifies the required 
manufacturing accuracy, close to the center of rotation, of the polarization 
transformer and of the mask in the Fourier-plane, which should be in the 
range of about 100 [im. 

5.5 Signal-to-noise Ratio 

The signal-to-noise ratio of our system is strictly related to the practical 
resolution achievable. In fact, one has to make sure that the number of pho- 
tons available are enough to allow implementation of the imaging scheme 
described above. 

As an example, let us consider the case when the detector is a CCD camera. 
Supposing that the CCD charge capacity is about 2 -r 4 • 10^ electrons per 
pixel, and assuming a quantum efficiency of about 50%, one concludes that 
the CCD camera will saturate at about 10^ photons per pixel or more. 

For our particular case of coherent OTR imaging, the estimated resolution 
according to the Sparrow criterion is about 6 [im. Because of the Nyquist 
sampling theorem, the CCD pixel size should be about 3 [im. A typical CCD 
pixel size is about 10 x 10 ^m^. Thus, to resolve 6 |Um features in the electron 
bunch, the magnification of the optical system should be about 1 : 3 or 
more. Choosing for simplicity the magnification 1 : 3, our zone of interest 
(region B in Fig. 7) will result in an image size of 300 X 300 [im^ in the image 
(detector) plane, i.e. 30 x 30 pixels. Similarly, the electron bunch density 
distribution (region A in Fig. 7) of order of 30 [im is imaged into an area of 
about 100 X 100 fim^, i.e. 10 x 10 pixels. 

Assuming 10^^ photons in the region of interest of the imaging system 
(region B in Fig. 7), we obtain about 10^ photons per pixel. The difference 
of four orders of magnitude beyond the saturation level of the CCD can 
be used to implement the particle-spread function improvement described 
in the previous Section, which may be quite expensive in terms of photon 
budget. 

Obviously, in any case, we should make sure that no pixel on the CCD 
camera gets more than 10^ photons. From Poisson statistics follows that the 
shot-noise level at saturation is about a fraction of a percent of the signal, 
i.e. the signal-to-noise ratio is a few hundreds. Shot noise is the dominant 
cause of noise at high photon-level conditions. However, while the signal 
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decreases, other sources of noise become important. As the photon level 
drops, also the shot noise level diminishes, and the CCD camera tends to 
become read-out-noise limited. Assuming a read-out noise e.g. of order of 
10 electrons, a certain pixel operates in read-out-noise limited regime when 
the number of photons incident on that pixel is of order 10^ or less. At 
this photon level the shot-noise and the read-out noise are comparable. The 
signal-to-noise ratio is of order 10, and quickly decreases as the number of 
photons decreases. The dynamical range of our device is therefore of about 
4 orders of magnitude, i.e. the ratio between the full-well capacity and the 
read-out noise. 



6 3D electron-bunch imaging from diffraction intensity measurements 
by use of iterative phase-retrieval methods 

Up to now we proposed to process the particle spread function of the OTR 
imager in order to improve the overall resolution of the system. We proposed 
to use coherently generated radiation but, from a fundamental viewpoint, 
techniques described in the previous Section maybe implemented for in- 
coherent radiation as well. The only problem using incoherent radiation 
is linked with the smaller number of available photons, compared to the 
coherent case, which makes the discussed technique unpractical. In other 
words, up to now we took advantage of the coherent nature of OTR in the 
sense that we exploited the large number of available photons. 

The fact that we are dealing with coherent radiation, however, opens up 
new imaging possibility. Namely, the detector can be directly installed in 
the Fourier plane, and imaging from diffraction intensity measurements can 
be enforced. Once the Fourier transform of the object is known, the recovery 
of the image is reduced to the calculation of an inverse Fourier transform. 
However, one can only measure the modulus of the Fourier transform, 
and a phase-retrieval problem arises. With their algorithm [15], Gerchberg 
and Saxton demonstrated that numerical solutions to the two-dimensional 
phase retrieval problem is possible. A number of phase-retrieval algorithms 
were developed based on the original Gerchberg-Saxton algorithm. For the 
interested reader we refer to extensive reviews on the subject ||47 t l-| |5ltl . 

Two issues need to be mentioned, which may affect the usefulness of the 
iterative phase-recovery method for our case. The first is the problem of 
uniqueness of the reconstructed object, the second is the speed of the algo- 
rithm, which may not be fast enough to perform coherent OTR imaging at 
an acceptable repetition rate. 

In the following we will study two cases when object and propagated field 
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Fig. 33. Lens-based setup for diffractive imaging of an electron bunch. 

are related by a Fourier transform. In the first case, the detector is placed in 
the focal plane of a lens. In the second, it is placed in the far (Fraunhofer) 
zone of the propagated field. The lens setup allows one to control the image 
parameters, but it can potentially introduce aberrations. The lensless setup 
instead, is defined by the position of the detector only. 



6.1 Lens-based setup for diffractive imaging 



6.1.1 Deconvolution by Fourier-plane mask 

Let us consider the setup in Fig. 33. A lens is used to obtain the Fourier 
transform of the object in the focal plane. Once the squared modulus of the 
Fourier transform of the object is recorded, the object can be reconstructed 
with the help of an iterative phase-retrieval algorithm. 

Let the object be placed in the front focal plane of the lenj^ The field 
across the back focal plane is given by Eq. (47), where we neglect lens 
imperfections, and assume that conditions (46) are satisfied. By means of 
the convolution theorem, the (spatial) Fourier transform of the field in the 
object plane, which enters Eq. (47), is given by the product of the Fourier 
transform of the single-particle field and the Fourier transform of the charge 
density distribution, g{oj, it). Note that here we are discussing a 3D Fourier 
transform, both with respect to time and space. In other words, g{a), it) is the 
3D Fourier transform of p{t,r): 



20 



If not, a phase correction needs to be accounted for. 
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Fig. 34. Reconstruction of an object from the modulus of its Fourier transform by 
using an iterative phase-retrieval method. For the electron-beam imager problem 
the a-priory constraint is the object nonnegativity. For this support phase retrieval 
is much easier and almost always unique. The method Vk^ill open up the possibility 
of real-time, wavelength-limited optical imaging of electron bimches. 



g{a), it) = drdt p{t, r) exp[zr • it ] exp[za;i] 



(92) 



Accounting for the vectorial nature of the OTR jfield, Eq. (47) can be written 
as 



P{rf) 



(93) 



The intensity in the Fourier plane is thus given by 



Q\(x}, 



(OYf 



\Pirf)f 



(94) 



The processing of the intensity is better explained in Fig. 34, where the part 
of the scheme after the lens is zoomed-in. The detector is installed in the 
Fourier plane. 
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Fig. 35. Structure of the coherent OTR radiation pattern as observed in the Fourier 
plane. Parameters are as in Fig. 7. 

The structure of the coherent OTR radiation pattern as observed in the 
Fourier plane is shown in Fig. 35. We are interested in the intensity distri- 
bution in the Fourier plane, in contrast to the previously discussed cases 
when we were interested in the intensity distribution in the image plane. 
The situation is much simpler: in fact, Eq. (94) consists of the product of a 
single-particle factor, the squared electron bunch structure factor, and the 
squared pupil function. This is due to the fact that the polarization of the 
coherent OTR field in the Fourier plane is radiaf^, with respect to the op- 
tical axis, exactly as the polarization of a single-electron OTR field. This 
property greatly simplifies the image processing, because we do not need a 
polarization transformer to deconvolve the electron bunch structure factor 
in the Fourier plane. 

Separation of the squared modulus of the spatial Fourier transform of the 
charge density distribution from the squared modulus of the spatial Fourier 
transform of the amplitude particle spread function maybe done at software 
level, by taking the ratio of the experimental data with the (known) squared 
modulus of the spatial Fourier transform of the amplitude particle spread 
function. An alternative possibility is to insert a filter in the Fourier plane, 
as shown in Fig. 33. The mask to be inserted has the same characteristics 
discussed in the previous Section 4.5.2. Note that here we are considering a 
range of tj such that Yjlf » This is our range of interest because the 
electron beam dimension Oy « yK (see Section 3) is reciprocally related to 
the high spatial-frequency limit, so that the relativistic factor y cancels out in 



For this to be the case, we need a uniform screen, i.e. our mirror has an aperture 
larger than yK. 
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Eq. (94). Therefore, Eq. (94) is independent of y, and the mask transmittance 
is given by Tm{r) in Eq. (70). 

Note that here we implicitly assumed that a bandpass filter is used, as 
discussed in the previous Section, to generate a linear integral of the signal. 
Since such filter is tuned at cOm, the intensity in the detector plane behind 
the mask and the bandpass filter is 



I{a)m, Tf) = const • 



Q\CO„ 



= const • a 



go\0, 




(95) 



where (see Eq. 78) g{a)m,u) = afgo{0,u) = Uf j_^dt(t>{t,u), where O is the 
spatial Fourier transform of po{t, r). The main difference to be remarked here, 
with respect to the discussion in the previous Section, is that for diffractive 
imaging one needs a bandpass filter, while in the previous Section one could 
choose between a linear or a quadratic integral projection of the signal. In 

fact, without a filter, one would record ^ d{ls.a))\QQ{ls.a),u)^, and it would 
be problematic to reconstruct the structure of the electron bunch in real 
space. On the contrary, once \QQ{Q,a)m?fl{cf))f- is measured, the solution of 
the phase retrieval problem yields back po(0, 7) - dtpo{t, r). Note that the 
filter should have a bandwidth smaller than the bandwidth of po{Ka), r). 



6.1.2 Phase-retrieval problem 

Once the squared modulus of the spatial Fourier transform of the electron- 
bunch projection along the time axis, \go{0,oj,nrf/{cf))\^, is obtained, one 
deals with a typical phase-retrieval problem. In fact, in order to reconstruct 
the object one needs to know both amplitude and phase of the diffraction 
field. However, the only directly-measurable quantity in the Fourier plane 
is the intensity, which is proportional to the squared modulus of the field. 
Determination of the missing phase from the observed intensity is known 
as phase-retrieval. In our case, must be obtained from the knowledge of 
l^oP- Only when the phase of go is known one can retrieve the object po(0, r). 

The phase retrieval is typically based on an Iterative Transform Algorithm 
(ITA) fisll , |47]-|51]. This cycles between real and reciprocal space, enforc- 



ing known constraints. In the reciprocal space, the known constraint is, 
naturally, the modulus of the Fourier transform of the object. 

In the real space, some generic constraints are usually granted too. First, any 
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physical object is square-integrable. Moreover, in real space, a finite support 
can be defined for any optical system. The support is defined as the set of 
points over which the object is nonzero, a finite support meaning that the 
object is nonzero only within a finite region of space. The shape of the object 
is usually required to be known in order to set the support constraint in the 
ITA. This support is typically larger than the actual boundary of the object, 
in which case it is said to be loose. A method for finding an estimate of the 
loose support from the support of the autocorrelation function may be used. 
In fact, the autocorrelation of the object can be calculated from the intensity 
distribution in the Fourier plane. Hence, the support of the autocorrelation 
is known. Then, from the support of the autocorrelation one can determine 
upper bounds on the support of the object, i.e. the loose support. 

Once the loose support is constructed, the initial phase input to the iterative 
algorithm can be a random phase set. Iteration by iteration, non-zero elec- 
tron density outside the finite support is gradually pushed to zero. In the 
reciprocal space, the measured magnitude of the Fourier transform of the 
object is enforced at each iteration. 

Since only the modulus of the Fourier transform is measured, the uniqueness 
of the solution of the ITA is a central question. When the object is complex, 
reconstruction is not trivial. Although, usually, the ITA gives good results, 
the uniqueness of the solution is not granted. However, when an object is 
endowed with the property of nonnegativity, the application of iterative 
methods is drastically simplified. 

Note that when our object is a temporal projection of po, i.e. po{0,r) = 
^ dtpQ{t,r), we are dealing with a positive object, because po is obviously 
a real positive function. 

Besides the non-zero electron density outside the finite support, also the 
negative electron-density inside the support is pushed to zero, iteration 
after iteration. The ITA will easily converge to the original object in the case 
of a two-dimensional phase-retrieval problem for positive images. 

When more a-priori information about the object is known, reconstruction 
becomes easier. In case a low-resolution image of the object is available, that 



image can be used as a constraint for reconstruction, and allows the ITA |50|] 
to work much faster. 



This suggests the idea of combining, as illustrated in Fig. 36, incoherent OTR 
imaging and diffractive imaging in a single technique. One may register, as 
before, the squared modulus of the Fourier transform of the charge density 
distribution with a coherent imager while, at the same time, a low-resolution 
image of the bunch may be taken with an incoherent imager. Then, the low- 
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Fig. 36. Reconstruction of an object from the modulus of its Fourier transform using 
the phase-retrieval method. The object domain constraint is a low-resolution optical 
image. Under such support, the method will open up the possibility of real-time, 
wavelength-limited imaging of electron bunches. 

resolution image can be used as a-priori information in the reconstruction 
process, allowing real-time, diffraction-limited resolution. 



6.2 Lensless setup for diffractive imaging 



An alternative application of diffractive imaging is a lensless setup. Actually, 
lensless imaging is one of the most promising techniques for microscale 
imaging of an electron bunch. The high-momentum vectors available in the 
Fraunhoffer diffraction plane can yield, in principle, wavelength-limited 
resolution without limitations which apply to lenses. 



6.2.1 Validity of the Fraunhoffer approximation 

In Fig. 37 we describe a setup where one takes advantage of free-space 
propagation of the electric field to the object plane. Ideally, detection should 
be performed in the Fraunhofer diffraction region. However, in practical 
cases, the diffraction pattern from the charge density distribution overlaps 
with the OTR halo, whose diffraction pattern may still be in the near zone, 
at the position of the detector plane. 



With reference to Fig. 7, specific regions on the OTR screen are related to 
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Fig. 37. Simplest lensless setup for diffractive imaging of the electron bunch when 
the detector is placed in the Fraunhofer diffraction plane. 
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Fig. 38. Radiation intensity pattern in the detector plane for a lensless diffractive 
imaging setup with a metallic mirror as an OTR screen. The problem is related to 
the fact that the OTR halo is not small enough for its Fraurihofer diffraction pattern 
to be formed in the detector plane. As a result, the Fraunhofer diffraction pattern 
of the object can only be viewed with significant interference from the OTR halo. 



characteristic scales from tens of microns up to a millimeter. The far zone 
condition reads as (A/a^)z » 1, where o is the characteristic object size and 
z the position of the observation plane. For small features of cr - 10 jUm in 
region A, within the electron density distribution, the far-zone condition 
requires z » 1 mm, but for the largest features in the halo region D we 
have o - yA, and for E = 17.5 GeV one needs z » 100 m to reach the far 
zone, which is obviously unfeasible. In other words, when one deals with 
the diffraction pattern from region D, the observation plane is in the very 
near zone. 
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Fig. 39. Image processing with an object plane mask for lensless diffractive imaging 
setup as possible solution of OTR halo problem. 

This fact leads to overlapping of the Fraunhofer diffraction patterns of re- 
gions A and D, as shown in Fig. 38. In fact, consider a characteristic value 
cr ~ 1 mm for the very near zone, and o — 10 jUm for the far zone. For the 
very near zone field, the size at the observation plane in the order of 1 mm 
(basically, the unvaried value of o). For the far zone field one may estimate 
that the size in the detector plane is the coherent angle A/cr ~ 10"'^ multiplied 
a typical value z ~ 10 cm. As a result, the size of the far-zone field at the 
observation plane is also about 1 mm, hence the superposition of the two 
diffraction patterns, which makes difficult to practically realize the lensless 
diffractive-imaging setup. 

A straightforward way to overcome this problem is to manipulate the distri- 
bution of the scattered field. Actually, the problem is solved when the OTR 
halo is suppressed. A square lattice of high-reflectivity metal disks with lat- 
tice parameter a can be formed on the OTR screen. The array should cover 
the region of interest only (region B in Fig. 7), which is smaller than the halo 
region. In this way, the electron bunch is practically sampled by the lattice of 
disks. The lattice parameter should be small enough, that when an electron 
bunch impinges on the screen one has d <^ a <^ o^, being the transverse 
rms size of the electron bunch, and d the disk diameter. For example, an 
array of small disks of about d = 1 [im diameter, with separation of about 
a = 5 [im maybe chosen, as indicated in Fig. 39. The (Fraunhofer) diffraction 
pattern from each disk can be estimated as Az/d ~ 10 cm. Therefore, it will 
only be limited by the aperture. In other words, with a resolution given by 
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the numerical aperture of the system, we can consider these disks behaving 
like 6-functions, compared to the electron-bunch scale. 



The Fraunhofer diffraction pattern of the sampled electron bunch has now 
a much larger dimension compared with the original one. Note that the 
implementation of this technique requires no optics, and does not impose 
stringent requirements on the detector. 

The field in the Fraunhofer plane is given by the far-zone expression 



where F{0,u) is the spatial Fourier transform of £(0,r) (see Eq. (40)). Thus, 
assuming that we are sampling the field at the OTR screen at positions fk, 
the sampled field in Eq. (96) amounts to 



The problem rising due to the manipulation of the OTR screen is significant. 
For the lensless setup we have I{co, f) proportional to the squared modulus 
of Eq. (97). In Eq. (97) we perform a Fourier transform of the electric field at 
the OTR screen with a fixed boundary, that is the array frame. Even without 
sampling scatters (imagine in place of the scatter array e.g. a rectangular 
screen with high reflectivity and size of the order of the region B) one has 
a complicated diffraction pattern for a single electron due, first, to the fixed 
frame profile and, second, to the vectorial properties of field. As a result, 
the single electron diffraction pattern cannot be deconvolved anymore from 
the 2D Fourier Transform of the charge density distribution. 

In this situation we should first reconstruct the field distribution on the 
OTR screen. The complexity of the problem is now linked with the vectorial 
nature of the field. The squared modulus of Eq. (97) actually consists in the 
sum of two diffraction intensity patterns from two orthogonal polarizations, 
and one should separately reconstruct both polarizations on the screen, 
i.e. field amplitude and direction. The problem can be equivalently stated 
considering a complex representation of the field as Ex + iEy. The squared 
modulus of Eq. (97) is the diffraction pattern from such field. Thus, our 
problem is to recover the complex-valued function Ex + iEy. 

Note that Ex,y > 0. This is strongly related to two facts. First, within the 
Ginzburg-Frank theory the electric field from a single electron at the OTR 




(96) 




(97) 
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screen is not complex-valued. Physically this means that, in this approxima- 
tion, the OTR field is characterized by a plane wavefront on the OTR screen. 
Second, the field on the OTR screen is a convolution of the single-electron 
field with a positive function, the temporal projection of the electron den- 
sity distribution, i.e. poiO,r). In this case, the reconstruction of the complex- 
valued function + iEy directly presents information about the vectorial 

properties of the field, because the fact that + iEy is a complex-valued 
function is only related to such vectorial nature. 



6.2.2 Strong support constraint based on tailoring of the OTR screen reflectivity 

As usual in phase retrieval problems, a-priori information about the object 
greatly enhances the efficiency of the iteration algorithm. 

In our case, we have a very well-defined boundary consisting in the sharp 



rectangular corners I of the sampling grid. The grid is placed within the 
region B in Fig. 7, meaning that all electrons are within the sampled area, 
and that the sampled area is completely illuminated, up to the boundary 
of the OTR screen. Thus, the edges of the grid-frame, i.e. of our object, are 
guaranteed to have a good contrast. This a-priori information is extremely 
useful as concerns the ability of reconstructing the electric field at the object 
plane. 

It is worth to restate this concept for the case described in Eq. (97). The 
solution of the 2D phase retrieval problem is unique, while that of the 
ID problem is not. Note that the Fourier transform of the sampled object 
can be expressed in terms of a polynomial. In particular, Eq. (97) is a 2D 
discrete Fourier transform, i.e. a 2D polynomial. The lack of uniqueness of 
the solution for the phase-retrieval problem is equivalent to the f actorability 
of this polynomial. Because of the Fundamental Theorem of Algebra, the 
ID phase-retrieval problem is known to present ambiguities. In contrast 
to this case, since polynomials in two variables are only rarely factorable, 
2D sampled objects usually present a unique solution to the phase-retrieval 
problem. 

Nevertheless, the reader should realize that the lensless imaging method 
stands on less solid ground, with respect to the lens-based setup discussed 
in the previous Section 6.1. In that Section, we based our technique on 
well-known methods that can always be applied, without any problem, 
in the case of non-negative objects. In other words, proposals considered 
there are based on already experimented techniques, and will surely work 



•^^ Note that the sharpness of this corners is assumed to be diffraction limited. In the 
optical wavelength range modern lithography can easily satisfy this requirement. 
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without difficulties. In the present case we outlined a novel technique. We 
recognized a problem related to the fact that the OTR halo is not small 
enough for its Fraunhofer diffraction pattern to be formed in the detector 
plana ^•^ I As a result, the Fraunhofer diffraction pattern of the object (i.e. 
of the electron bunch) includes significant interference from the halo. We 
sketched a possible solution to this problem, but we did not go deep into 
models and simulations. 



6.2.3 Retrieval of the electron density distribution from the electric field 

The last step in our method consists in retrieving the charge density dis- 
tribution from the knowledge of the electric field. To be specific, since, as 

before, we used a bandpass filter centered at the modulation frequency cu^, 

-» 

the electric field reconstructed by the iterative algorithm is E(a;,„,r). 

The Fourier transform of the charge density distribution at a; = Wm (i-e. 
p{a)m, ^ = afpo{0, r)) is related to the field by the Gauss law, and therefore 



po(0,f)cxV-E(a;,„,r). (98) 

Outside the electron bunch, the electric field has vanishing divergence. 

The validity of Eq. (98) is strongly related to our analogy of the single- 
electron field with the field from a wire. Such analogy is valid (see Eq. 
(9)) for r <?c yA and, in particular, for t < Or yA, which is within the 
bunch region. Then, our object can be expressed as a convolution between 
the charge density distribution (that is, within the electrostatic analogy, the 
wires distribution) and the electric field of each wire, scaling as 1 /r. 

Finally, it should be stressed out that the grid of scatters can be considered 
as a fixed reference frame. This allows to measure with high accuracy the 
absolute size of the electron bunch and the absolute position of its center of 
gravity as soon as the absolute position of the grid is known. 



There is no fundamental problem related to the far-zone plane. Difficulties only 
arise on practical grounds here due to z ~ 10 cm. 
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6.3 Technique to characterize the 3D electron bunch structure with a multi-shot 
measurement 



6.3.1 Combination of real and reciprocal space imaging spectrometers 

Up to now we used coherent OTR imaging techniques to obtain (directly or 
after solution of a phase retrieval problem) a projection of the charge density 

^ dtpQ{t, r). Here we will discuss how it 
is possible to combine diffractive imaging and direct imaging techniques to 
obtain the 3D electron bunch structure. 



Very recently in |14l] a combination of diffractive and direct imaging to obtain 
the spatio-temporal structure of a cylindrical-symmetric optical pulse was 
proposed and demonstrated. In this case, the 3D problem reduces to a 2D 
phase-retrieval problem. Then, a spectrometer with a dispersive element 
could be used, rather than a plurality of bandpass filters which would 



have been necessary in the 3D case. The Gerchberg-Saxton algorithm |15|] 
could then be used to retrieve the phase along the spatial direction, and 
an independent FROG measurement at a fixed transverse position allowed 
phase renormalization of the phases between different frequency slices. 

Here we propose to use a similar idea for the full 3D problem. Our setup 
records two 3D "cubes" of spectral data from the image both in real space 
and reciprocal (i.e. spatially Fourier-transformed) space. These 3D cubes of 
information include two spatial dimensions and one spectral dimension in 
both domains. 



The technique is not based on a single-shot, but rather on a multi-shot 
measurement. The obvious requirement is, of course, the reproducibility of 
the 3D electron bunch density from shot to shot. 

A possible setup for the implementation of our method is sketched in Fig. 
40. The OTR signal is split by a semitransparent mirror into two parts, for 
the diffractive imaging, and for the direct imaging. The only difference to the 
setups described above for direct and diffractive imaging is that, now, the 
central frequency of the bandpass filter can be changed with high repetition 
rate, up to 10 Hz, that is the macropulse repetition rate of the European 
XFEL. 

Due to the adiabatic approximation, the diffractive imaging setup records 
the distribution \go{Aa),it)\'^, where w = a)m^fl{cf), and Acu is, as usual, the 
shift of the central frequency of the bandpass filter from a;,„. The direct 
imaging setup records |po(A(x', r)p instead. Thus, by scanning through Ao; > 
0, we obtain information about the modulus of the spatial Fourier transform 
of the charge density distribution and about the modulus of the charge 
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Fig. 40. Combination of real and reciprocal space imaging spectrometers for char- 
acterizing the 3D electron density distribution by use of multishot measurements. 

density distribution. Both these distributions are recorded in the temporal- 
frequency domain, at different frequencies Ao; ^ 0. 

Note that, for Acu + 0, the quantity po is riot real. In other words, for each 
value Acu, our setup records information about the modulus of the temporal 
Fourier transform of the electron density distribution and its spatial Fourier 
transform, but misses all information about phases. We are once more con- 
fronted with a phase retrieval problem, where we know \qq{I^co,u)'^ , and 
|po(A(i;,/^P, and we want to know po (or, equivalently, ^o)- The object to be 
reconstructed is now complex-valued, but we have strong a-priori informa- 
tion in both real and reciprocal space. 

The Gerchberg-Saxton algorithm [15], which was the first practical iterative 
algorithm to solve the Fourier-phase-retrieval problem, constitutes a way to 
achieve the recovery of the phase from two intensity measurements in the 
image and in the Fourier planes in an imaging system. Since the introduction 
of the Gerchberg-Saxton algorithm, numerous variations have been studied 
1 47, 4^ 51] and may be possibly used instead of the original one. 



The Gerchberg-Saxton procedure gives amplitude and phase in r (or u) for 
each frequency. The algorithm is applied only along the spatial dimension, 
so that each frequency slice is independent of the other. In other words, 
there is a random phase jump between each frequency slice. Yet, if we 
want to perform an inverse temporal Fourier transform, we need to have 
information about the relative phases between different frequency slices. 
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The setup in Fig. 40 cannot provide this information. It should be presented 
as a-priori knowledge. 

Fortunately, this information is available from undulator optical replica 
synthesizer (ORS) measurements [8]. As discussed before, from the FROG 
trace we can reconstruct amplitude and phase of the Fourier transform of 
the longitudinal bunch profile. As a result, if we integrate the charge density 
distribution over both transverse coordinates along a given slice, we can use 
the ORS data as reference point for the phase and obtain the renormalized 
phase distribution within the slice. Then, phases acquire sense, not only 
relative to other points within the same slice, but also relative to each point 
in other slices. We can then obtain the full 3D electron bunch structure by 
performing a final temporal Fourier transform. 



6.3.2 Dijfractive imaging spectrometer. Projection algorithm. 

The technique described above takes advantage of both real and reciprocal 
space imaging spectrometers to reconstruct the 3D complex-valued Fourier 
transform of the electron density distribution. The major difficulty in apply- 
ing this technique is that the 3D cubes of available spectral data are sliced at 
different frequencies, thus yielding go{/^oJ, 0Jm^f/{cf)) in the reciprocal space, 
and poCAcu, r) in the real space. When Ao; ^ 0, the object in real space, po, is 
complex. Once the phase problem is solved with the help of the Gerchberg- 
Saxton algorithm and reconstruction is done slice by slice, one still needs 
already available data to renormalize the relative phases between different 
slices. 

Another possible approach to obtain the full 3D electron bunch structure 
for a multi-shot measurement without these drawbacks makes use of a 
diffractive imaging spectrometer alone, as shown in Fig. 41. Such setup allow 
to record the 3D cube of spectral data \go{AiXi, com^f / {cf))\^ , which consists of 
less information compared to the technique described above. 

The method takes advantage of a different way of postprocessing the avail- 
able data. For example, one can slice |^o(Aa;, at Aco = in the {Ux, Uy) 
plane, aXUx-O in the (Acu, Uy) plane, or at My = 0, in the (Ao;, Ux) plane. 

The advantage of slicing the available spectral data in this way is clear if one 
remembers that the iterative reconstruction algorithm aims to reconstruct 
e.g. po(0, X, y), from the knowledge of \qq{Q, Ux, Uy)\^. As we have seen before, 
the quantity po(0, x, y) is just the projection of pQ{t, x, y) along the temporal 
axis, i.e. 
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Fig. 41. Diffractive imaging spectrometer for characterizing the 3D electron density 
distribution by use of multishot measurements. 

po(0, X, y) = {po{t, X, y)>j = J" dtpQ{t, x, y) , (99) 

where we introduced <po(^,x, y))^ as the temporal projection of po. Now 
we use the same relation between projections and inverse two-dimensional 
Fourier transforms. In other words, the projections (po(i,x, y));^y of po along 
the X axis, or the y axis, defined by 



<po(f,x,y)>^ = J" dxpo{t,x,y) (100) 

and 

<po(f,x,y)>y = J dypo{t,x,y) , (101) 

are found by solving the phase retrieval problem for \go(Aco,0,Uy)\^, or 
\go{Aco, Ux, 0)p. An example of how such projections should look like, based 
on start-to-end simulations, is given in Fig. 23. 

One concludes that the objects to be reconstructed are, in these cases, real 
and non-negative. As we have seen before, this fact drastically simplifies 
the use of iterative reconstruction algorithms granting quick convergence. 

We are thus capable of recovering the three orthogonal projections of po 
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along t, X and y. One may also recover other projections as well. For example, 
suppose that, instead of cutting the spectral data at = (or Uy = 0) we 
cut it along another line, passing by the point Ux — Uy - but tilted of an 
angle a with respect to the Ux axis in clockwise sense. This means that we 
are simply performing a rotation 

u'^ = Ux COS a + Uy sin a 

u'y = -Ux sin a + Uy cos a . (102) 

With respect to the new system {Leo, u'^, u'y), we are considering the cut 
|^o(A(i', u'^, 0)p, from which we can recover (po(^/ ^' , y'))y" which is the (non- 
negative) projection of po along the axis y' , where 

x' = X cos a + y sin a 

y' = -xsina + y cosa . (103) 

Obviously, we can choose whatever angle a we like in the {Ux, My) plane, but 
also in the (Acu, w^) plane, or in the (Acu, Uy) plane. As a result we can retrieve 
any projection of po(i, x, y) on any plane in the (t, x, y) space containing any 
of its axis. More in general, in our 3D cube of data there exists a very special 
point where all three frequencies Acu, Ux and Uy are zero. We may consider 
any 2D plane passing through this point and reconstruct any projection in 
the real space-time domain under the positivity constraint. 

This information is sufficient to recover the 3D function po(f, x, y). 

The method that we just proposed is a multi-shot technique operating in 
the 3D Fourier domain, recording a 3D cube of information (the intensity 
in the 3D Fourier domain), and reconstructing the 3D real space-time elec- 
tron density distribution through projections, which are recovered, in their 
turn, by reducing the phase-retrieval problem to the well-known 2D phase- 
retrieval problem under the nonnegativity constraint of the object. We name 
this technique "multi-shot FRODI" (Frequency-Resolved Optical Diffractive 
Imaging). 

6.4 FRODI technique to characterize the 3D electron hunch structure by use of a 
single-shot measurement 

In this Section we use the new technique Frequency-Resolved Optical Diffrac- 
tive Imaging (FRODI), which was introduced above, for 3D imaging of an 
individual, ultrashort electron bunch. Using a diffractive imaging spectrom- 
eter based on a wavelength-dispersive element, this technique measures a 
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Fig. 42. A schematic of FRODI, an apparatus capable of measuring three orthogonal 
projections of the electron density distribution for a single ultrashort electron bunch. 
This will allow retrieval of the full 3D structure of the electron bunch. 



2D spectrum of a single pulse. FRODI involves mapping the spatial fre- 
quency onto the transverse position, so that the relevant transverse spatial 
coordinate becomes the spatial-frequency axis, e.g. Ux at Uy = 0. This can be 
accomplished with the help of a slit in the Fourier plane centered on the 
optical axis. The temporal frequency is the other spatial coordinate, so that 
a detector can record the trace of a single pulse. In the previous Section 6.3 
we saw that retrieving e.g. the projection {x, t) of the electron bunch from 
the trace {Ux, co) is equivalent to the solution of a 2D phase-retrieval prob- 
lem under non-negativity constraint. FRODI allows measurements of the 
3D structure of a single electron bunch. This is accomplished by splitting 
the optical beam and simultaneously measuring the orthogonal projections 
(x, t), (y, t), and (x, y). The respective 2D traces can be obtained on three 
separate detectors. 

Thus, the method of projection considered above can be successfully ap- 
plied in the single-shot case as well. The overall idea is to acquire a set 
of projections of the electron bunch in the (cu, Ux, Uy) space from a single 
shot, and to apply reconstruction techniques to retrieve the electron density 
distribution po{t, x, y). 

The main difference compared with a multi-shot measurement is that we 
do not acquire the full spectral cube of information through the multi-shot 
measurements, but only a few projections, those necessary to successfully 
take advantage of tomographic reconstruction methods. 
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Fig. 43. Beam geometry for measurements of the (x,t) electron bunch projection. 
The prism-lens combination constitutes a spectrometer. 
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Fig. 44. Slit geometry for measuring different projections of the electron bunch with 
a single-shot 3D imager. 

A possible realization of the single-shot FRODI is shown in Fig. 42. The OTR 
radiation is redirected into three (or more) stations, where required slices of 
the 3D spectral cube of information are recorded. In the scheme in Fig. 42 
we consider the three slices at Ux = 0,Uy = and Aco = 0. 

The projection along the time axis is recovered from the slices at Ao; = 0. The 
setup used for this operation is the usual diffractive imaging spectrometer 
in Fig. 34, where the bandpass filter is tuned at Aco = 0. 

The projections along the x and y axis are recovered from the slices at M;^ = 
and My = 0. In order to obtain a two-dimensional trace in the {Aco, Uy) plane 
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(or (Ao;, Ux) plane) one needs a wavelength-dispersive element capable of 
encoding Aco into the spatial transverse direction. This can be accomplished 
with the help of a setup like that in Fig. 43. The usual Fourier mask is placed 
in the Fourier plane, followed by a slit to select the slice at M;^ = or Uy = 0. 
In principle, one may select other slices by tilting the slit at a given angle, as 
shown in Fig. 44. This is the equivalent of choosing a given angle a in the 
discussion given in the previous Section about the projection method. After 
the slit, the optical signal passes through a dispersive element, e.g. a simple 
prism, and the 2D trace in the (Ao;, Uy) or {Aco, Ux) plane is subsequently 
recorded by a detector. 

All 2D traces are then used as an input for iterative retrieval algorithms, 
yielding back the projections of the charge density distribution po{t,x,y) 
onto the (x,y), the {t,x), and the {t,y) plane. Tomographic reconstruction 
follows from the knowledge of these projections. 

As noted before, if more projections are needed one may extend the scheme 
in Fig. 42 including slits with different orientations. The problem of charac- 
terizing the 3D structure of an electron bunch from a single-shot measure- 
ment can now be solved by standard reconstruction techniques typical of 
tomography. 

The FRODI concept proposed here is part of a new generation of electron- 
beam diagnostic techniques based on coherent optical light. It does not 
need any kind of synchronization, is based on single-shot measurement, 
guarantees micron resolution, and operates in real time. 

6.5 Numerical processing of the data recorded in the Fourier plane 

Demands on the detectors for the diffractive imaging case are qualitatively 
different from those for direct imaging. In the case of diffractive imaging, a 
certain structure of size a on the object is related to a structure of size A//cr in 
the Fourier plane. Consider e.g. a focal distance of a few tens of centimeters. 
The smallest features to be distinguished on the object, of order of a few mi- 
crons, correspond to structures of order of a centimeter in the Fourier plane. 
Similarly, the largest features, of order a few hundreds microns, correspond 
to structures of order of a hundred microns in the Fourier plane. 

Now, the irradiance on the detector decreases with 1 /r^ as we move from 
the center to the periphery, due to the 1/r behavior of the spatial Fourier 

transform of the single-particle field. As the photon density decreases, the 
relative importance of shot-noise increases, i.e. the signal-to-noise ratio gets 
worse. In particular, since shot-noise follows Poisson statistics, we expect 
a degradation of the signal-to-noise ratio as S/N oc 1/r. A way of avoiding 
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this degradation is to bin the detector pixels properly (or to use a ad hoc 
engineered detector with custom pixels of different shape and area). Sup- 
pose that we bin the detector by first dividing the available area in circular 
sectors and then dividing again each circular sector in many radial slices of 
a given thickness. In this way, the area of each bin would increase quadrat- 
ically with the radius, thus compensating the decrease of photon density. 
The number of photons per bin can be made constant by a proper binning. 
As a result, the signal-to-noise ratio linked with shot-noise would not be 
subject to degradation anymore. 



7 Measurement of the full spatio-temporal charge density of ultrashort 
electron bunches with Fourier transform holography 

A further natural development of our imaging techniques consists in the 
introduction of a known reference wave, transforming the previously dis- 
cussed diffractive imaging setup into a setup capable of recording Fourier 
holograms. Since only one Fourier transform is required to obtain digital 
reconstruction, Fourier transform holography (FTH) is far superior over all 
iterative phase retrieval methods in terms of computation time. 

A concept of a FTH setup is sketched in Fig. 45 (see fs^). We summarize here, 
for future use, the working principle of Fourier holography. Let us define the 
electric field in the object plane in the space-frequency domain 0(0), r) for 
the object, and R{co, r) for the reference. Later on we will see that in our case 
of interest complications are introduced by the space-variant polarization of 
the electric field in the object plane. However, for now we assume that O and 
are scalar amplitudes, i.e. amplitudes of a fixed electric-field component. 

In the Fourier plane we define 0{a),rf) and 1i{a),rf), where 0{a),rf) = 
j dr 0{co,r) exp[ico / {cf)r ■ Vf] is the Fourier transform of O, and 'R is sim- 
ilarly defined. The electric field in the Fourier plane is given by the sum 
P ■ 'R + P ■ O, where P is the pupil function of the optical system, and we 
consider the system isoplanatic as before. Since the pupil is considered as a 
simple circular aperture of radius a, the pillbox function P has the property 
= P as before. A detector will record a holographic pattern I{co, Tf) given 

by 

I{co, f}) P(r7) \nco, r-}) + 0{co, ^'ff = Pi^f) |^R(a;, fff + P(r7) \0{cv, ^f)f 
+P{?f)K{co,?f)-0{co,?f) 

+P{rf)n{w, r-» • 0\w, r-» . (104) 
An interference pattern is thus recorded in the Fourier plane. Such pattern is 
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Fig. 45. Optical system used to record a Fourier hologram. 

called a hologram. Inspection of terms in Eq. (104) shows that the hologram 
is a superposition of the squared modulus of the Fourier transform of the 
object, of the squared modulus of the Fourier transform of the reference, 
and of interference terms. The squared modula carry no extra-information 
compared to what can be obtained from diffractive imaging techniques: 
information about the phase of the Fourier transform of the object is not 
included there. Such information is included, however, in the interference 
terms. Actually, it is fair to say that Fourier holography works because the 
detector is a nonlinear device, yielding the squared modulus of the input 
amplitude, which includes interference terms. 

The field distribution of the reference wave is known. Consider Fig. 45 and 
suppose, which is the case for our setup, that the width of the reference 
source is of order of the optical wavelength of interest. Then, the refer- 
ence source can be considered as a point source. It follows that "Rico, rf) = 

exp[ico/{cf)(i ■ Vf], j6 being the vector position of the reference. Substituting 
the expression for K in Eq. (104) and taking the inverse Fourier transform 
of the holographic pattern, one obtains 



?f)\ (r) cx \p{y^f) \R{w, ?f)\ J (r) + T-' [P(r-» \0{w, ?f)\ J (r) 
^ dr'Vico, r + j3 - f) • 0{co, f) 
J dr'P{a),r- ^-f)-0*{a),r') . (105) 



+ 
+ 



Thus, with accuracy given by the dimension of the pupil aperture, the in- 
verse Fourier transform of the holographic pattern yields straightforwardly, 
through the interference terms, a direct image of the original object O, and 
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Fig. 46. Practical setup for Fourier transform holography using a virtual reference 
point source. 

a conjugate image O*. Note that such direct image is actually a convolution 
of the object with a specific point spread function. In fact, we always have 
blurring due to diffraction effects. 

The object and the reference source must be arranged with a separation 
larger than twice the object width (this is called the FTH condition). Then, 
by inspection of Eq. (105) follows that the object is well-separated from the 
second term of Eq. (105), which amounts to the autocorrelation function of 
the object. 

For optical applications, the resolution of holographic techniques is not 
limited by size and quality of the reference source. With the help of modern 
lithography it is not difficult to produce an unresolved point source at optical 
wavelengths and let sufficiently bright radiation through it. 



7.1 Real-time high resolution FTH imaging of electron bunch 



A big advantage of using FTH techniques compared to diffractive imaging 
methods explored in Section 6, is that no iteration algorithm is needed. Thus, 
real-time, high-resolution imaging of the electron bunch is granted through 
FTH. 

A way a FTH setup may be implemented is shown in Fig. 46. It can be seen 
to consist of the previous diffractive imaging setup shown in Fig. 33 together 
with a reference source. The reference source is constituted by a screen with 
a reference pinhole illuminated by a laser pulse. The laser pulse should 
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Fig. 47. Sketch of the polarization for object and reference wave in the hologram 
(Fourier) plane. 

be synchronized to the electron bunch with sub-picosecond accuracy. The 
object wave and the reference wave are summed up with the help of a 
semitransparent mirror before the lens. 

Note that, as was discussed before in Section 6, and for the same reasons, 
a narrow bandpass optical filter centered at the modulation frequency is 
needed before the detector. 



7.1.2 Laser with circular polarization as reference source 

A specific aspect of our case study, compared to usual FTH setups, needs 
further investigations. As we mentioned before, we derived Eq. (105) under 
the assumption that O and R are scalar objects. In our case, however, the 
field distribution of the object wave in the Fourier plane is radially polarized 
independently of the charge density distribution, as shown in Fig. 47. Note 
that the object field distribution exhibits a complicated space-variant (not 
radial) polarization. 

Since the setup is designed to resolve the object, we assume that the size 

of the Fourier pattern of the object in the detector plane is much smaller 

compared to the available numerical aperture (multiplied by /). Therefore, 

the FTH condition is still formulated requiring that the separation between 

the reference source and the object should be larger than at least two times 

-» 

the characteristic size of the object. This means that |j6| should be taken 
larger than the object autocorrelation function in order not to superimpose 
the object image with the central spot. 
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Fig. 48. Schematic representation of digital Fourier hologram recording and its 
numerical reconstruction. 

Space-variant polarization yields difficulties when one tries to apply the 
FTH algorithm in the usual way described above. Namely, in our case, 
interference terms are multiplied by a factor depending on the position in 
the Fourier plane. 

To show this, let us first introduce polar coordinates (p and Tf, so that Vf = 
(Tf COS {(p),rf sin {(p)). Supposing that the reference wave is linearly polarized 

along e.g. the x direction, one has 'R{co,rf) = exp[ico/{cf)fi ■ rf]{l,0) and 

3{co, Tf) - 0{co, ff){cos{(p), sm{(p)). It follows that Eq. (104) is modified to 



+P{rf) exp 
+P{ff) exp 



0{(o, Yf) cos{(p) 
0*{co, ff) cos{(p) . 



(106) 



In this case, taking the inverse Fourier transform of Eq. (106) would not give 
back Eq. (105), due to the specific kind of position-dependent factor cos ((/)). 
The problem is that cos{(p) becomes zero for certain values of (p, and can be 
filtered out at the expenses of a reduced resolution only. 

A possible solution consists in using a circularly polarized reference wave, 
instead of a linearly polarized one. Fig. 47. In this case, one has 9^(0), Tf) = 
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exp[za;/(c/)jS • rf]{l,i). Then, Eq. (106) changes to 



+P{rf)exp — 7jS • Tf 



0(a), ff) exp[i(p] 



+P{rf) exp 




0*{a),rf)exp[-i(p] . 



(107) 



Eq. (107) includes a position-dependent factor as well. However, such fac- 
tor never becomes zero, and it can be easily filtered out improving the 
resolution. A vortex phase mask exp[-i(p] can be applied. Fig. 48, for im- 
age processing purposes. Note that one may choose whether to process the 
signal digitally, or to use additional hardware for this purpose. The same 
remark applies to further image processing described in the next section as 
well. To fix a particular case study we assume, for example, that the vortex 
mask is included into the numerical image processing algorithm. Applying 
the vortex mask one obtains 

I{cl), Tf) exp[-i(p] oc P{rf) exp[-i(p] + P{rf) \0{co, fy)|^ exp[-i(p] 



After taking the inverse Fourier transform of the holographic pattern in Eq. 
(108), the object 0{a), r) will appear again from the third term in Eq. (108). 

Because of the extra phase exp[-i(p], which is now present in the first terms 
of Eq. (108), the Fourier transform will be blurred, but its size cannot be 
visibly changed because the object is assumed to have much larger size 
compared to the point-spread function. In fact, before multiplication by the 
vortex mask, the Fourier transform of the object, O, is automatically limited 
by the pillbox function P, due to a finite numerical aperture of the optical 
system, which is unity for values of r^// smaller than the numerical aper- 
ture, and zero otherwise. This pillbox function actually acts as the Fourier 
transform of the point-spread function of the optical system. Due to the 
presence of the vortex phase mask, the pillbox function is multiplied by 
the extra position-dependent phase-factor exp[-i(p], and the point-spread 
function of the system is now given by the Fourier transform of the prod- 
uct of the pillbox function by exp[-i(f)]. Note that here we cancelled the 
position-dependent phase-factor for the direct image term, thus obtaining a 
non-blurred image of the object, at the expense of blurring both the autocor- 
relation and the inverse image of the object. The analysis in [,53J, where one 



+P{rf) exp 



+P{rf) exp 




0*{a),rf)exp[-2i(p] . 



(108) 
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may jfind plots of the Fourier Trarisf orm of the product of a pillbox function 
and a vortex, allows to conclude that this Fourier transform of the product 
of pillbox function and vortex is just a few times wider compared to the 
point-spread function (which is the Fourier transform of the pillbox only). 
As a result, after convolution with e.g. the autocorrelation function, we have 
practically the same size as before the convolution, and the FTH condition 
holds. 



7.1.1 Mask for optimum FTH imaging of electron bunch 

When we discussed diffractive imaging techniques in Section 6 we saw that 
the input to the phase retrieval algorithm is Eq. (95), which was obtained 
from Eq. (94) through Fourier filtering. Note that Eq. (94) is proportional to 
the squared modulus of the field at the Fourier plane, Eq. (93). In our case 
of interest, the Fourier transform of the object, 0{ff), coincides with Eq. (93). 
The difference with respect to that case is that, now, 0{rf) does not enter as 
a squared modulus in Eq. (108), but only linearly. Therefore, at first glance, 
using the same spatial Fourier filter as in the previous Sections we cannot 
optimize the image resolution anymore. 

However, if we properly introduce another filter after the screen with the 
reference source in Fig. 46, we can optimize the image resolution and still 
keep the previous filter in the Fourier plane. Since the filter after the screen 
should work as a Fourier filter partly countering the contribution of the 
filter in the Fourier plane, it must be placed in the far-zone of the reference 
wave. If we indicate with Zm the distance between the screen and the mask, 
and with d the dimension of the reference source, the far-zone condition 
is d^/{Az,„) «: 1. This is not a restriction. In fact (i ~ A, and the Fraunhofer 
condition will always be satisfied behind a screen with such a pinhole. Then, 
if we define the amplitude transmittance of the mask in the Fourier plane 
Tm{r), which is explicitly given by Eq. (70) with To = 1, we should introduce 
a second mask with amplitude transmittance proportional to T^^(r) after the 
screen with the reference source. Let us name such transmission function 
Tm,2- The mask will be installed at some distance z from the pinhole, in the 
far zone. Due to the far-zone approximation, angular openings 6 are related 
to transverse sizes on the mask by r = dz. Since we are only interested in 
6 > 1/y we can set Tm,2 - for r < z/y. Then, if the numerical aperture of 
the lens is given by NA, one can set T„, 2 = (NA)To,2z/r for r > z/y, with the 
condition {NA)T o^2Z < 1- When this is done, given the input field 




(109) 
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one can rewrite the third term in the pattern in Eq. (108) with the help of Eq. 
(94) as 



I{a), Tf) exp[-Z{^] oc exp 



cf ^ 




P{rf) . (110) 



Note that, as concerns the object signal, we deconvolve the electron bunch 
structure factor in the Fourier plane. One should have a good signal-to-noise 
ratio, which is mandatory for success in the implementation of our scheme. 
Such ratio is related to the manipulation of the reference source, i.e. of a laser 
source passing through two spatial Fourier filters. The method of binning 
discussed in Section 6.5 for increasing the signal-to-noise ratio maybe taken 
advantage of. 



7.2 Self-referencing FTH imaging 



A natural extension of the FTH imaging idea is to use the object wave to 
measure itself. This allows one to image the electron bunch without the need 
for a separate reference source. 

There are several approaches to self-referencing schemes in FTH imaging. 
One version of these techniques makes use of the reference pulse which can 
be obtained from the object pulse itself using spatial filtering. An alternative 
to the scheme proposed in Fig. 46 would be, in fact, to split the optical replica 
pulse from the OTR screen into two, and spatially filter one copy in order 
to obtain a reference pulse, i.e. a quasi-planar (being spatially filtered) wave 



in the detector (Fourier) plane. This wave ^'^ I interferes with the second copy 
of the original pulse at some angle provided by the FTH condition. Of 
course, both pulses should be spectrally filtered with a narrow band-pass 
filter at frequency cOm. Note that in this geometry the object wave, i.e. the 
original optical-replica pulse, should be reflected a few times off mirrors and 
beam-splitters so that both object and reference pulses can be synchronized 
and combined in the detector plane. Additionally, in our case the object 
wave should be attenuated of about two orders of magnitude to increase 
the contrast of the hologram. This kind of self-referencing FTH schemes 
is based on the use of standard techniques and are widely discussed in 



literature, see e.g. p6\ 



Note that this wave has linear polarization, but can now be transformed into 
circular polarization (in self-referencing schemes). Then, numerical processing in- 
cluding the vortex phase in the hologram can be applied as discussed before. 



100 



100 fs electron bunch with 
density modulation at 800 nm 
wavelength 



coherent 
OTR 



SOO nm bandpass — 
optical filter with 
bandwidth 1 nm 

mask 
Fourier plane 




► lens 



Fourier transform 
hologram 



reflection coeficient 
of screen is about 1 



profile of 

electron 

bunch 



CCD camera 
connected to 
computer 



2D Fourier transform 
of the hologram 




□ 



reference 

scatter 
reflection coefficient 
of scatter is about 907, 



Fig. 49. Setup for Fourier holography, where the OTR halo outside of the electron 
bunch profile is used to create a reference wave. 

In this Section we consider a particular realization of self-referencing FTH 
imaging, which is based on the use of the coherent OTR halo to produce a 
reference wave. In this case we need to consider a relatively large size of 
the reference source. Moreover, the reference wave has linear polarization. 
These facts imply the production of a lower-resolution image compared 
to the high-resolution image where an optimized reference source is used. 
However, due to the absence of an external reference source, the holographic 
setup turns out to be the same as for diffractive imaging. Then, the low- 
resolution image can be used in combination with previously discussed 
diffractive imaging techniques. 

In this way, an implementation of FTH schemes only consists in the addition 
of some modification in the OTR screen to an already existing (diffractive 
imaging) setup. Fig. 33. This is an obvious advantage, which opens up e.g. 
the possibility of simultaneously performing diffractive imaging and FTH. 
For example, the original holographic pattern can be processed in parallel 
with Eq. (110). Then, standard phase-retrieval techniques can be applied to 
the second term in Eq. (105). The advantage of the combination of FTH with 
iterative phase retrieval methods in X-ray applications has been already 
demonstrated experimentally in |5^. 



7.2.1 Use of the coherent OTR halo to produce the reference wave 

A straightforward idea to implement an FTH setup for low-resolution FTH 
imaging without the help of an external laser is to take advantage of the 
large OTR halo outside the electron bunch profile on the OTR screen. A small 
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disk with high reflectivity coefficient may be formed on the OTR screen, like 
in Fig. 49, and we also assume that the reflectivity of the entire OTR screen 
is made lower. 



The reliability of the image obtained by holography is an important advan- 
tage of FTH over diffractive imaging. No iterative process is involved in the 
numerical reconstruction of the object image from the hologram. There are 
no convergence and uniqueness problems as in the iterative phase retrieval 
approach. 

However, in all FTH techniques, the spatial resolution limit achievable is 
defined by the size of the reference source. The size of the reference source 
has a strong effect on the reference photon flux too. The smaller the source, 
the smaller the number of photons in the reference signal, the smaller the 
signal-to-noise ratio. Thus, there is a limit to how small the size of the source 
can be. Of course, in case an external laser is used to define a reference, one 
can think of focusing the laser on the reference pinhole, or to increase the 
power of the laser, up to some extent. In the setup in Fig. 49, there is no such 
possibility. Hence we need to consider a relatively large size of the reference 
source. Fig. 49, which implies, in its turn, a lower-resolution image. Such 
low-resolution image can be used, as explained before, as a strong support 
for diffractive imaging techniques, so that the iterative algorithm can run in 
real-time. 

However, it should be noted that since the OTR halo is used to illuminate 
the reference source, one does not deal anymore with a circularly polarized 
reference wave. The electric field in the OTR halo region is radially polar- 
ized. Therefore, the polarization of the reference wave is linear. Its direction 
coincides with the direction j3 of the vector identifying the position of the 
reference scatter with respect to the center of the OTR screen. 

In this case one has "^(w, r^) = exp[/(i;/(c/)jS-ry]j3/j3andC/(a;,r/) - 0{co,rf)rf/rf. 
It follows that Eq. (106) is modified to 



+Pirf) exp 
+P{ff) exp 



0{cj,?f) 



(111) 



Since we want to use diffractive imaging as the main technique here, and 
the FTH only provides a better support for the iterative algorithm, we may 
want to use a mask with amplitude transmittance Tm{r) given in Eq. (70) 
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in the Fourier plane. However, in the case under consideration we cannot 
use an additional mask for the reference source. As a result, we have a new 
situation concerning FTH imaging. After the Fourier mask, the third term 
in Eq. (Ill) reads 



I{co, Tf) oc exp 




P(r» '-^ . (112) 



The factor f5 ■ rf/fi appearing in the interference terms now yields a difference 
compared to the FTH imaging technique described above. 

Taking the inverse Fourier transform of Eq. (112), we see that such interfer- 
ence term will not yield the function p but, rather, the directional derivative 
of p along the polarization direction of the reference signal, given by the 
unit vector ftp, shifted of j6 and properly scaled. This fact can be easily un- 
derstood. First, we remember that, given the function f{rf), its directional 
derivative along the direction ftp is given by 



^.nV^M). (113) 
From Eq. (113) and from the properties of Fourier transforms follows that 

r-' [n^ ■ ff /(r-»] (r) = '-^ftp ■ {vr-' [/] (^/)) . (im) 

Thus, our method foresees the use of a known directional derivative of po at 
Aco = as a support for diffractive image. The iterative algorithm will act on 
the autocorrelation term in the holographic pattern with a strong support, 
together with the positivity condition for po(0, f). This should be sufficient 
to obtain real-time convergence of the iterative algorithm. 

Note that the reference scatter has a high reflection coefficient. In order to 
provide a reflectivity contrast we should actually introduce an attenuator 
for the OTR pulse. When we additionally introduce a mask we might end 
up with an insufficient photon flux in the detector plane. However, starting 
with diffractive imaging setups, we have a choice between two possibilities 
of imaging processing. We can use a transparency with amplitude transmit- 
tance as we discussed before, or we can use numerical processing. In the 
case of Diffractive imaging and FTH with external reference source we have 
such choice as well. Since we already suppressed the OTR screen reflectivity, 
the numerical image processing option is to be preferred. In this case, the 
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Fig. 50. A straightforward extension to the Fourier transform holography technique 
is to introduce multiple reference sources. This method can be used to increase the 
amount of a-priori information used in iterative algorithm. The picture shows 
arrangement of the reference scatters on the vertices of a regular pentagon [55]. 
The arrows show directions of the polarization of the OTR field at the scatters. Two 
redundant images of the bunch appear for each reference source. Five independent 
images of the bunch can be recovered from the hologram and then used as support 
constraint for high resolution diffractive imaging. 

numerical processing can be separately optimized for the FTH image and 
for the DI, which operates with the autocorrelation of the object. 



7.2.2 Multiple reference FTH 

A natural generalization of ideas and observations in Section 7.2.1 consists 
in increasing the amount of a-priori information used in the iterative algo- 
rithm. In reference f5§\, a multiple reference source FTH was used in the 
soft x-ray range to simultaneously record five images of the sample. It was 
demonstrated that averaging these images yields an improvement in the 
image quality due to enhancement of the signal-to-noise ratio. 

Here we still consider the setup in Fig. 49, but with several reference sources. 
The situation is illustrated in Fig. 50 where, for the sake of illustration, five 
references are considered. From each reference source one can reconstruct 
a directional derivative of the object. The a-priori knowledge of the (differ- 
ent) polarization directions of the field at the reference points on the OTR 
screen can be used to retrieve the image from the hologram. One recon- 
structs different directional derivatives. Therefore, the bunch profile can be 
reconstructed. 
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Note that the reconstructed Fourier transform contains the autocorrelation 
of all reflecting structures. Two redundant images of the object appear for 
each reference scatter. One image is the cross-correlation of the reference 
with the object, while the other, its complex conjugate, is located radially 
opposite to the origin. The autocorrelation contains five independent images 
of the object. The center is occupied by the autocorrelation of the object 
and of each reference scatter. Images should tile, and not overlap on the 
autocorrelation. This is realized in the arrangement of the reference scatters 
on the vertices of a regular pentagon. 

It should also be noted that the previously discussed schemes using the 
OTR halo as a reference source are particular cases of self -referencing FTH 
measurements. In this case, the low-resolution image would serve as an 
input for the iterative phase retrieval algorithm, giving to the numerical 
algorithm a starting point closer to the real solution. In such combined 
approach, the larger features of the bunch are first imaged unambiguously 
by holography. By subsequent oversampling phasing, one can try to further 
improve the image resolution up to the diffraction limit. A combined Fourier 
holography-oversampling approach has the advantage that high-resolution 
and real-time reconstruction can be reached simultaneously. 

7. 3 Technique to characterize the 3D electron hunch structure by use of multi-shot 
measurements 

7.3.1 Frequency-gated Fourier transform holography 

In Section 6 we proposed a technique to characterize the 3D electron den- 
sity distribution taking advantage of a diffraction-imaging spectrometer, 
in multi-shot geometry. Here we propose another technique for multishot 
measurement of 3D charge density distribution based on frequency-gated 
Fourier Holography. 

A possible realization of the method is schematically shown in Fig. 51. The 
setup is very similar to that proposed in Fig. 46 for Fourier Holography 
imaging, except for two particulars. First, instead of a relatively long (1 ps) 
laser pulse we should use a 10 fs laser pulse. Second, instead of a fixed 
bandpass optical filter we consider an interchangeable bandpass filter with 
the same 1 nm bandwidth, whose central frequency can be rapidly changed, 
with a repetition rate similar to that of the electron-bunch trains. 

The duration of a pulse AT is related to the spectral range AA (in terms of 
wavelengths) by AT ~ A^/(cAA). The laser pulse is chosen short enough so 
that the reference pulse has a spectral range (about 100 nm) of the order of 
that associated with the minimal time scale which we are planning to resolve 
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Fig. 51. Multishot measurement of the 3D structure of electron bunches by use of 
frequency-gated Fourier transform holography. 

(about 10 fs). Note that we do not have synchronization problem because 
when the combined electromagnetic disturbance from reference and object 
passes through the usual bandpass filter, the filter stretches it yielding a 
wave packet about 1 ps long (for a 1 nm-bandwidth filter). As a result for 
overlapping reference and object pulses on the detector we only need sub- 
ps synchronization. If we now tune the bandpass filter to other frequencies, 
we can perform sliced measurements of po{Aco, r) within the predicted wide 
spectrum of the optical replica pulse due to varying transverse size along 
the electron bunch. 

We should reconstruct the 3D cube of data given by Fourier transforming 
poi^oj, r) with respect to Aco. Note that, for Aco + 0, the quantity po is not real, 
so we should address the problem that we can only reconstruct the relative 
phases in each frequency slice. Yet, if we want perform inverse temporal 
Fourier transform we need to have information about the relative phases 
between slices. Our setup cannot provide this information. It should be 
presented as a-priori knowledge. The relative phases between slices can be 
measured using an undulator Optical replica Synthesizer (ORS) [8]. In other 
words, we assume that we are performing an extra FROG measurement of 
the peak-current profile to correctly set the relative phase of each frequency- 
slice ' 



•^^ It should be stressed that the scheme in Fig. 51 is a purely linear optical scheme 
and does not require intense pulses. We assume that we perform an additional 
FROG measurement over the undulator radiation in the ORS setup [8], to define 
the relative phase of each Fourier component. The undulator radiator in the ORS 
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We can then perform numerically, and with negligible computation time, the 
Fourier Transform of po{Aco, r) with respect to Ao; and have a high resolution 
image. 

We may note here that the possibility of reconstructing complex objects 
like po{AiXi,r) is relatively recent and specific of digital recording only We 
have a possibility to reconstruct taking the Fourier transform of a complex 
object. Because the reconstructed wave field is a complex function, both the 
intensity as well as the phase can be calculated. This is in contrast to the case 
of optical hologram reconstruction, in which only intensity is made visible. 
During many decades, only optical reconstruction of holograms was used, 
and only the intensity distribution of the object was recorded. 

The present technique may be very attractive in practice. Again, problems 
with iterative technique are avoided, and there are no difficulties related to 
synchronization for 3D characterization. The only limitation of this method 
is that it relies on a multishot geometry. 



7.3.2 Spatio-temporal Fourier transform holography 

In the following we discuss an extension of the FTH technique from the 
space into the space-time or, rather, into the space-frequency domain. In 
time-domain holography, the reference pulse is a short pulse. During holo- 
graphic recording, the complex amplitude of the spectral component of 
each signal pulse is stored in a series of fringes, which arise as a result of 
interference with the corresponding spectral component from the reference 



pulse. This spectral holography technique was first proposed in |57 , 
and experimentally demonstrated on the femtosecond time-scale in [59 
A close analogy exists between FTH of monochromatic waves and spectral 
holography of time-varying signals. This analogy results from the similarity 
between the equation describing spatial diffraction and temporal dispersion. 



Spectral holography is also named by others "spectral interferometry" |60|1 . 
It should be mentioned that interferometric stability is not required in our 
case, since this method is based on single-shot interference-pattern measure- 
ments. Spectral interferometry may be coupled with spatial interferometry 



to preserve the spatial-phase information ||61|]. 



In 1620, an extension of spatial holography and spectral interferometry 
into spatio-temporal holography was proposed and experimentally demon- 
strated, allowing for the reconstruction of the full spatio-temporal electric 
field from a single ultrashort laser pulse. Such technique constitutes the 



setup yields energies of about a few tens microjoule per pulse, which is much more 
than the energy from the coherent OTR pulse, and more than sufficient for FROG 
measurements. 
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Fig. 52. Muitishot measurement of the 3D structure of electron bunches by use of 
spatio-temporal Fourier transform holography. 

state of the art as regards the diagnostics of ultrashort optical pulses. 

Authors of [62] deal with the problem of measuring an arbitrary pulse. 
As a result, they only use self-referencing schemes. Therefore, they must 
take advantage of a-priori known information about the reference pulse in 
the temporal direction, meaning that they need FROG measurement of the 
reference pulse after spatial filtering. Since the technique is self-referencing, 
this requires a signal pulse with large energy. We deal, however, with a 
simpler situation. In our case, the optical replica pulse has a duration of 
100 -r 200 fs, so that a laser with about 10 fs-long pulses, which is available, 
can be used as an external reference source. Therefore, there are no strict 
requirements on the energy of the signal pulse. 

A possibility to extend FTH techniques from the space to the frequency- 
space domain is illustrated in Fig. 52. We propose to use an imaging spec- 
trometer with a dispersive element installed in the Fourier plane of the 
coherent imaging system. The slit of the spectrometer is centered on the 
optical axis and can be rotated around the optical axis. In this way, we can 
record a two dimensional slice of data with spatial frequency on one axis 
and spectral wavelength information on the other. 

The reference source can be considered as a point source in the space- 
domain and as "temporal point source", i.e. a "short" optical pulse in the 
time domain. Object and reference source must be arranged in space with a 
separation twice larger than the object width (this is the usual FTH condi- 
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tion) and must be arranged with a separation in time domain twice longer 
than the object pulse duration (this is the temporal equivalent of the FTH 
condition). 

Problems with the polarization of the object are solved in the same way as 
for the frequency-gated FTH imaging i.e. by using a circularly polarized 
reference pulse. The position of the slit and its relation with the polarization 
of the OTR radiation pulse was already discussed when considering the 
FRODI setup (see Fig. 44). The polarization of the OTR radiation pulse will 
be selected by the slit orientation, and will always be linear independently 
of the orientation of the slit, because it is positioned in the spatial Fourier 
plane and it is centered around the optical axis. As a result, with a circularly 
polarized reference source we will always have the same optimal interfer- 
ence for any orientation of the slit (aside for a constant, unimportant phase 
in the hologram). 

This technique has the same advantages of holographic techniques. The only 
disadvantage is constituted by the need of additional hardware (reference 
laser source), which is common to all frequency-gated FTH techniques. 
However, no precise synchronization is needed: sub-ps accuracy will suffice. 
In fact, the only important point is that temporal separation will obey the 
temporal equivalent of the FTH condition, also accounting for jitter. For the 
holographic technique to work, it does not matter if the temporal separation 
fluctuates from shot to shot. 

Similarly to FRODI, this multishot geometry can be rearranged in a single 
shot geometry. Several detection stations can be arranged to simultaneously 
measure few projections of the charge density distribution. Moreover, as for 
FRODI too, there is no need for a-priori information from FROG measure- 
ments. The advantage with respect to FRODI is that no ITA is needed, which 
will substantially speed up the reconstruction process. 



7.3.3 Self-referencing measurements in spatio-temporal FTH 

In the previous Section we discussed spatio-temporal FTH. This approach 
requires a well-characterized reference pulse. In this Section we restrict 
our attention on self-referencing spatio-temporal FTH, in which the Optical 
Replica pulse is used to measure itself. This allows one to reconstruct the 
3D structure of the electron bunch without the need for a reference laser 
source. 



In particular, in this Section we discuss a method based on 2D shearing inter- 
ferometryiil [|63l] in the spatio-temporal frequency domain {co, u^). Shearing 



There is no important physical difference between interferometry and hologra- 
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Fig. 53. Self-referencing spatio-temporal FTH concept: BS, beam splitter; M, mirror. 
A Michelson interferometer provides independent control of the spatial shear and 
delay between two copies of the input coherent OTR pulse. 

interferometry is a well-known technique for the measurement of wave- 
fronts in the spatial domain. It operates by having the wavefront with 
phase function (p{x,y) interfering with a test-replica of itself shifted (lat- 
erally sheared) along the x axis by a distance Ax. Thus, at a given spatial 
point (x, y) on the detector, the phase difference (f){x + Ax, y) - (f){x, y) can 
be extracted from the measured interferogram. If Ax is sufficiently small, 
such phase difference is proportional to the gradient d(p/{dx). The unknown 
spatial phase (p{x, y) can then be obtained (up to an additional constant) by 
integration. 

There is a well-known one-dimensional version of this technique to charac- 
terize ultrashort laser pulses, which is called Spectral Phase Interferometry 
of the Direct Electric-field Reconstruction (SPIDER) [64]. The SPIDER tech- 
nique uses the principle of spectral shearing interferometry to measure the 
spectral phase. Typically, SPIDER is based on the interference of two time- 
delayed replicas of the input pulse, which are also frequency-shifted by Ao; 
with the help of non-linear optical methods. The spectral gradient d(p/{dco) 
of the phase (p{co) is then recovered from the one-dimensional interfero- 
gram. Further integration yields back the phase (p{co). A one-dimensional 
shearing-interferometry setup in the temporal-frequency domain is more 
complicated than its spatial counterpart, because it requires non-linear op- 
tics to generate the spectral shear. However, the SPIDER technique can be 
extended to allow for the reconstruction of the spatio-temporal phase from 
a measurement of the spatial gradient of the phase [|63i] . 



phy. 
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In our case of interest, it is possible to completely characterize a slice of 
electric field E{(jo, it) as a function of cu and e.g. Ux for a given Uy = 0, and 
subsequently scan the angle of the slice in the (Ux, Uy) plane, corresponding 
to the {Xf,x/f) plane, i.e. the Fourier plane. We refer to the scheme in Fig. 53. 
The phase (p{co, Ux, 0) can be measured with a spatial shearing interferometer, 
which generates two copies of the input field, one of which is spatially 
sheared by Au^ with respect to the other, and with relative spectral phase 
COT. The field at a given plane is imaged through the two arms of a Michelson 
interferometer onto the slit of the imaging spectrometer, which is equipped 
with a dispersive element, so that the imaged intensity is given by 



I{co, Ux) oc \e{co, Ux, 0) + E{co, Ux + Aux, 0) exp[za;T]| . (115) 

If the fringe period due to the reference phase cot is sufficiently small, the 
interferometric component E*{co,Ux + Aux,0)E{co,Ux,0)exp[icoT] can be ex- 
tracted with the help of Fourier processing techniques. This can be obtained 
by using a temporal delay t larger than a few times the duration of the 
Optical Replica pulse. The argument of the interferometric component is 



d(h 

(p{cO, Ux + AUx, 0) - (p{cO, Ux, 0) = AUx-^ — {cO, Ux, 0) + CUT . (116) 



Through the settings of the Michelson interferometer, one can control the 
shear Aux and the delay t. After subtraction of the reference phase from 
the right hand side of Eq. (116), one obtains the frequency-resolved spatial- 
gradient of the phase d(p/{dUx){co,Ux,0), which can be then integrated to 
give the phase cp{co,Ux,0) + (f)Q{co), where (f)Q{co) is an arbitrary function of 
the frequency only. This function can be eliminated by renormalizing the 
phase at each co for the fixed spatial frequency Ux = and My = to the 
phase obtained from the FROG measurement of the peak-current profile in 
the ORS undulator setup. In fact. 



(pvROcico) = cpico, 0, 0) + (Poico) . (117) 

The technique that we just described shares the same advantage of the 
spatio-temporal FTH approach discussed in Section 7.3.2 and, additionally, 
it does not need a reference laser source. The only disadvantage is consti- 
tuted by the need for a-priori information from FROG measurements. 



Ill 



current / kA 




Fig. 54. Peak-current profile at the entrance of the SASE undulator (after XFEL 
TDR). 

7.3.4 Time-gated Fourier-Transform Holography 

Up to now we introduced 3D FTH techniques, which operate in the Fourier 
domain. These techniques require no special synchronizatioriT^ This is one 
of their advantages. Here we will introduce time-resolved FTH techniques 
operating in real space-time domain, which are based on the fact that holo- 
grams record information about the object only under simultaneous illumi- 



nation by a coherent reference wave |16|1 



A laser capable of producing single pulses much shorter than the elec- 
tron bunch is needed for time-resolved imaging. Reference lasers with 10-fs 
pulse duration are available. One of the main technical problems for time- 
resolved holography is the synchronization between reference and optical 
pulses. Here we propose a way of getting around this obstacle by shift- 
ing the attention from the problem of synchronization to the problem of 
measurement of the slice peak-current. 

Consider Fig. 54, which shows the foreseen longitudinal bunch profile (cited 
from [3]). Due to the strong non-linear dependence of the FEL process on 
the peak-current, only the central part is suitable for lasing. 

It is therefore of special interest to develop techniques capable of imaging 
the slice with maximal peak-current, with a typical width of about 3 [im (or 
10 fs). The setup of the proposed time-gated FTH technique is shown in Fig. 
55. 



•^'^ Sub-picosecond synchronization is still required in our proposed scheme, but it 
may be relaxed using a longer laser pulse, which poses more stringent requirements 
for the average power of the laser beam. 
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Fig. 55. Time-gated FTH setup for imaging the slice of electron bunch with maximal 
peak-current. Since the integral of the image of the electron bunch density distri- 
bution is known, the slice with maximal peak-current can be uniquely determined 
even when time jitter is present. 
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Fig. 56. Sketch of principle for time-gated FTH-imaging of the bunch slice with 
maximal peak-current. 



The method is based, as before, on the Fourier Transform holography. The 
idea is to use a very short reference laser pulse of the order of 10 fs, i.e. 
the slice longitudinal dimension, in order to obtain contributions to the 
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Fig. 57. Setup for recording image of bunch slice with maximal peak-current using 
ultrashort seed laser pulse for optical replica synthesizer. 

interference terms in the hologram from a single slic^ within the electron 
bunch. 

In contrast to the frequency-gated FTH, no spectral filter is use so that 
the reference and the optical replica are not easily synchronized in the holo- 
gram plane. In fact, both optical replica and reference laser pulse are subject 
to time jitter in the order of 100 -r 200 fs, and sub-picosecond synchronization 
does not guarantee that the reference pulse overlaps with the maximum of 
the optical replica pulse in the hologram plane. In order to solve this prob- 
lem one needs to discriminate the slice with maximal peak-current within 
the electron bunch from the rest of the beam. 

The duration Xref - 10 fs of the reference pulse is relatively long due to the 
applicability of the adiabatic approximation, as cXref/A ~ 30 at our selected 
optical wavelength, and it is relatively short compared to the electron bunch 
duration, as Xp/xref ~ 20. 

A method for imaging the bunch slice with maximal peak-current is il- 
lustrated in Fig. 56. The hologram records information about the object 
only when it is illuminated with the coherent reference wave. One of the 



^ Note that the actual thickness of the slice can be arbitrary, from 3fim up to the 
limit for projected emittance, depending on the reference pulse length. 
•^^ The Fourier masks are present instead. However, their presence (in all schemes 
starting with those in Section 6) can be replaced by numerical image processing 
before reconstruction. 
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Fig. 58. Simple method for solution of the double-valued (peak-current) function 
problem in multi-shot 3D electron bunch FTH imaging. The reference laser pulse 
is split into two parts, where one is delayed with respect to the other of about half 
electron bunch duration. Both pulses are passed through different quarter wave 
plates and as a result have different chirality. The pulses are recombined before 
being sent onto the focusing optics (see Fig. 55). 

main technical problems for time resolved holography techniques is the 
relative synchronization of the reference laser pulse and the optical replica 
pulse from the OTR screen. In fact, both optical replica and reference laser 
pulses are subject to time jitter. As mentioned above, we shift the attention 
from the problem of synchronization to the problem of measurement of 
the peak-current of the slice. If a relative peak-current is known for each 
slice, a sorting of results according to peak-current measurements uniquely 
gives the slice with maximal peak-current without knowledge of the delay 
between reference and object pulses. 

Another technique closely related to the time-gated method for imaging of 
the electron bunch with maximal peak-current is shown in Fig. 57. The idea is 
to use a very short (order of 10 fs) reference laser pulse in order to obtain the 
contribution from a single slice within the electron bunch (see [10, 11]). As 
before, the arrival times between the laser pulse and electron bunches jitter 
from shot to shot. In the case of Fig. 57, we have the additional possibility 
to control the slice peak-current using a bolometer, i.e. measuring the total 
charge in the modulated slice on a shot-to-shot basis. Subsequently, the 
acquisition system will discriminate the slice with maximal peak-current. 
In this case, angular filtering of coherent OTR is needed before the bolometer 
in order to select the halo part, whose energy-per-pulse does not depend on 
the transverse size of the electron bunch. 



The time-gated FTH technique for imaging the electron-bunch slice with 
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Fig. 59. Schematic representation of 3D multi-shot measurements in a time-resolved 
FTH setup. Left: time diagram of the reference wave. Right: temporal profiles as 
they appear in the hologram plane. The new method attempts to get around the 
jitter obstacle, by measuring the slice peak-current. If the relative peak-current is 
known for each slice, sorting the results according to the vortex sign at the central 
(low spatial frequency) region of the hologram uniquely gives the position of the 
slice within the electron bunch. 



maximal peak-current can be extended to a scheme to provide the 3D image 
of the electron bunch in a multi-shot geometry. The idea exploits the jitter 
between the reference (laser) pulse and the electron bunch, i.e. the optical 
replica pulse. In fact, due to the presence of jitter, not only the slice with 
maximal peak-current will be recorded. The main problem is to discriminate, 
shot by shot, the longitudinal position of the slice in order to reconstruct the 
3D image of the electron bunch. One can measure the peak-current of the 
slice as discussed before, with reference to Fig. 55, but there is always an 
ambiguity on whether a slice with a given peak-current is located before or 
after the slice with maximal peak-current. In other words, with the exception 
of the slice with maximal peak-current, we have a difficulty with a double- 
valued function (peak-current as a function of position) for other peak- 
current values. 



Here we propose a solution to this problem with the help of a double 
reference pulse, which may be produced as shown in Fig. 58. A 10-fs linearly 
polarized laser pulse is first split into two identical pulses. The polarization 
of these pulses is changed from linear to circular with the help of two 
different quarter-wave plates, emerging with different chirality. Finally, one 
of the two pulses passes through a delay stage of about half electron bunch 
duration or more before being sent to the focusing optics of the FTH setup 
(see Fig. 55), so that one obtains two 10-fs reference pulses with opposite 
chirality delayed of about 100 fs as shown in Fig. 59), left. 
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(a) (b) (c) 




Fig. 60. Holographic patterns corresponding to (a) chirality +1 of the reference, (b) 
chirality -1 of the reference and (c) sum of the previous two cases. 

We should now be able to distinguish among the three situations shown on 
the right of Fig. 59. When this is accomplished, the 3D reconstruction can 
be successfully performed. Due to the particular delay choice, there are no 
other beam geometries to be studied. The idea is to distinguish among the 
three situations on the right of Fig. 59 by studying the low-frequency part of 
the hologram, related to the halo regions C and D in Fig. 7. In these regions, 
the hologram shape will not depend on the transverse electron density 
distribution. Use of numerical post-processing deconvolution (instead of 
using two masks as in Fig. 55) will serve our purpose here. 

The shape of such kind of holograms "is characterized by nearly paral- 
lel" fringes, "except for a forking pattern in the vicinity of the core" [65] , 
as shown in the example of Fig. 60. For chirality -i-l (bottom case on the 
right of Fig. 59, corresponding to case (a) in Fig. 60) the forking pattern will 
be directed upwards, for chirality -1 (upper case on the right of Fig. 59, 
corresponding to case (b) in Fig. 60), downwards. When we have simul- 
taneous overlapping of the two reference pulses with the Optical replica 
pulse (center case on the right of Fig. 59, corresponding to case (c) in Fig. 60) 
we have a combination of two forking patterns one upwards and the other 
downwards. 

It remains to be discussed whether the separation of the holographic fringes 
is enough to guarantee distinguishable patterns. The separation between 
the fringes depends on the transverse offset of the reference source relative 
to the object. When the offset of the reference source increases up to about 
3o, o being the rms transverse electron bunch dimension, the fringes will be 
separated of a distance proportional to l/3cr, which is much larger compared 
to 1/ (yK). As a result, for the first fringe we will obtain a fork which is well 
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distinguishable, meaning that it does not overlap with the bunch structure 
and it is not influenced by the structure of the single electron pattern. 



We can speculate that computerized analysis will easily distinguish among 
these three kinds of events, solving the problem of identifying the position 
of the slice within the bunch. There are many advantages related to the 
application of this technique. In fact, it avoids complications like the need 
for ITA, it does not require a-priori information about the electron bunch 
structure (i.e. can be used to measure ultra-short bunches) and it does not 
require synchronization. 



7.4 HOTRI technique to provide a full 3D image of individual, arbitrary electron 
bunches 



In this Section we introduce a novel technique to characterize the 3D struc- 
ture of the electron bunch. This technique combines multiple-reference FTH 
(Section 7.2.2) and time-gated FTH (Section 7.3.4) methods. The fact that 
multiple-reference FTH may be naturally used for time-resolved experi- 
ments was first discussed in a concept presented in |^66ll. 

Multiple-reference FTH can allow us to simultaneously record images of 
different slices of the electron bunch, thus providing a 3D image of a single 
electron bunch. Our idea is to vary the time-delay of the reference sources. 
Each reference source is a small disk with high reflectivity, and the distance 
from the reference scatter to the membrane is varied in order to create a 
time-delay between reference wave and OTR object wave. Fig. 61, left. As a 
result, once the hologram is processed, different reconstructed object images 
will correspond to different slices with given time-delays along the electron 
bunch. Fig. 61, right. 

From the measurement using the setup shown in Fig. 62, we can make a 
movie (five frames with 10 fs exposure-time and 50 THz frame-rate) of the 
electron bunch propagating through the OTR screen, showing the electron 
density distribution against space and time. We named this novel technique 
Holography Optical Time Resolved Imaging (HOTRI). 



8 Sensitivity to the energy chirp of the electron bunch 



As we mentioned above, some coherent imaging techniques take advantage 
of the fact that the test pulse is bandwidth-limited. In this Section we will 
discuss what conditions should be met for the optical replica pulse to be 
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Fig. 61. Membrane for time-resolved FTH with reference scatters (left) and recon- 
structed holographic images of electron bunch slices (right). 
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Fig. 62. 3D electron bunch structure monitor by use of the HOTRI technique. 

considered bandwidth-limited, and what is the impact of a phase-chirp on 
the different techniques proposed in this work. 

In Section 2 we discussed the impact of self-interactions on the optical 
replica. We demonstrated that, in practical cases of interest, self -interactions 
effects and energy-spread influence can be neglected, and the amplitude of 
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Fig. 63. The foreseen longitudinal phase-space distribution of electrons after the 
magnetic compressor BC2 at the European XFEL, extracted from 1,671 . 



the bunching can be considered uniform along the bunch. This is sufficient 
to produce an optical replica of the electron bunch, that is a radiation pulse 
whose electric field amplitude is a replica of the charge density distribution 
of the electron bunch. The optical replica pulse can be used to characterize 
the longitudinal bunch profile of the electron bunch [8] with the FROG 
technique. 



It should be remarked, however, that the bunch compression procedure 
introduces an energy chirp in the electron beam after the magnetic chicane 
BC2 in the case of the European XFEL, Fig. 5. This chirp introduces, as it will 
be discussed below, a distortion into the phase of the bunching which is not 
important for the ORS setup as concerns peak-current measurements, but 
is important when dealing with electron bunch imaging. A comprehensive 
analysis of the evolution of the longitudinal phase space after compression, 
including nearly all important physical effects (space charge interactions. 
Coherent Synchrotron Radiation (CSR), shape variation, resistive walls) is 
given in |67|1 . The longitudinal phase-space of the electron bunch just after 
the magnetic compressor BC2, taken from |j6^ is reproduced in Fig. 63. 



The energy chirp shown in Fig. 63 is due to a combination of different 
effects. In fact, energy chirp is intrinsically introduced in the framework of 
a single-particle dynamics during the bunch compression procedure, but 
it is modified by the presence of other wakefields. Moreover, as noted in 
[|67tl , the bunch formation system is composed by several chicanes, which 
complicates furthermore the situation. A qualitatively similar conclusion 
can be found in simulations for LCLS (see the study on longitudinal phase- 
space in PJ). 

The measurement of the longitudinal bunch profile by FROG is insensitive 
to the presence of the energy chirp and, better, as shown in |68|l , the ORS can 
be used to measure the energy chirp. In fact, a FROG device can measure the 
phase of the optical replica, and the phase is strongly related to the energy 
chirp by 
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64> = . (118) 

Km y 

However, the phase chirp of the optical replica pulse is, in its turn, related 
to whether or not the optical replica pulse is bandwidth limited. 

For a bandwidth-limited pulse of duration Xp and bandwidth Acu one has 
AcuTp ~ 1. This implies that Aco/co ~ (cuTp)"^. Also, for a bandwidth-limited 
pulse, one should have a constant phase i.e., from a practical viewpoint, a 
phase-shift smaller than a radian along the pulse. A phase-chirp linear in 
time corresponds to a shift in the carrier frequency. One can account for such 
a chirp by introducing an effective carrier frequency. Thus, we will neglect 
the linear phase-chirp and account for the non-linear phase-chirp only. We 
may restate the condition above, requiring that the non-linear chirp should 
be sufficiently small, and the phase distortion across the pulse should be 
smaller than a radian. 

Here we consider the low energy case of 2 GeV after the second bunch 
compressor, where the longitudinal phase-space is depicted in Fig. 63. The 
value of the momentum compaction factor is that of the ORS dispersion 
section, i.e. J^se ^ 50 /im. With the help of Eq. (118), one can see that a phase 
shift 6(p of a radian corresponds to an energy shift of about Ay ~ 5 MeV 
along the pulse. Considering the European XFEL case, one should extract 
from the large linear chirp Fig. 63, the non-linear energy chirp component. 
A quick estimation shows that the non-linear component of the chirp is in 
the order of 2 -r 3 MeV per bunch length, yielding a phase shift smaller than 
a radian. There is some room to optimize the situation by increasing the 
energy modulation introduced by the seed laser, increasing the energy in 
the seed laser pulse and subsequently decreasing the R56 parameter in Eq. 
(118) of a factor 2 ^ 3. 

It should be noted that the large chirp shown in Fig. 63 is introduced ad hoc 
at the European XFEL (and at LCLS too) in order to diminish CSR effects 
in the chicane. Additionally this also allows to compensate for the linear 
component of the energy chirp introduced by wakefields after the bunch 
formation system, which has opposite slope. In several situations of interest, 
the non-linear component of the chirp can be smaller at the end of the bunch 
formation system. 

Having said this, it is interesting to discuss which techniques presented 
in this paper are influenced by the energy chirp of the electron beam, and 
which are not. 

First, let us consider methods that are not influenced. As we have seen in 
Section 5, direct (coherent) imaging without spectral filter allows one to 
record the integral of the squared modulus of the optical replica pulse, i.e. 
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a quadratic projection. In this case, any phase deviation along the optical 
replica pulse is irrelevant. In fact, energy chirp only introduces a phase per- 
turbation which depends on time and not on transverse coordinates, at least 
within a single-particle dynamics approximation in the bunch-formation 
system. 

Also techniques based on the a-priori use of information from independent 
FROG measurements work, without modification, with and without chirp. 
This is the case when we combine real and reciprocal space spectrometers. 
Fig. 40, and when we consider frequency-gated FTH, Fig. 51. Other tech- 
niques which are completely insensitive to energy chirp are spatio-temporal 
FTH, Fig. 52 and HOTRI, Fig. 62 and, more in general, all time-gated FTH 
techniques. This follows from the short duration of the reference pulse in all 
these cases, which is 10 f s long and has a bandwidth of about 100 nm. In fact, 
the non-linear energy chirp necessary to significantly perturb the operation 
of time-resolved techniques based on such kind of reference pulses can be 
estimated to be about 100 MeV, which is far from the design parameters of 
the European XFEL and LCLS. As a result, direct (coherent) imaging with- 
out spectral filter, techniques based on independent FROG measurements, 
the spatio-temporal FTH method and all time-gated FTH techniques are 
insensitive to energy chirp. 

Second, there are techniques that are influenced by the presence of energy 
chirp, but that may be easily modified to account for such presence. These 
are schemes operating in the frequency domain in the 3D imaging mode, 
i.e. the multishot measurement method based on the use of the projection 
algorithm. Fig. 41, and FRODI, Fig. 42. In order to account for the effects of 
the energy chirp, in these cases we can make use of the knowledge about the 
phase deviation along the optical replica pulse as measured by FROG. This 
requires modification of the technique, because it implies an extra FROG 
measurement. Note that the required information is automatically available 
if we perform peak-current profile measurements. 

Once the phase along the optical replica pulse is known, we can use it 
e.g. in the reconstruction of 2D traces for FRODI. Consider, for the sake 
of illustration, the (x, t) projection. If we assume that the optical pulse is 
bandwidth-limited, its phase is constant, and po and Uf are real and positive 
(see Eq. (78)), i.e. one can talk about the slowly varying real and positive 
envelope of the puis If, instead, the optical pulse is not bandwidth 
limited, one has fl^ = exp[z(/)(i)] with constant and (p{t) a non-zero phase 
due to energy chirp within the electron bunch. 



Note that poiAco, r) is still a complex-valued function of Aco. Also note that in 
this case, po{0,r) = fl// po{t,r)dt. In other words po at Aco = coincides with the 
temporal projection of po- 
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Therefore, when the energy chirp is negligible, the envelope of the optical 
replica pulse is an exact replica of po{t,x,y), i.e. of the electron density 
distribution, which is real and nonnegative. As seen before, this is enough 
to guarantee unique reconstruction of the object. The fact that the optical 
replica pulse envelope is real and nonnegative actually amounts to complete 
information about the phase of the pulse envelope, which is just constant. 
When non-negligible energy chirp is present, the pulse envelope is no more 
real, but the phase along the t axis is nevertheless fully determined by the 
FROG measurements, and this is enough to guarantee reconstruction. This 
allows to reconstruct any arbitrary projection (x', t), where x' is rotated of an 
arbitrary angle with respect to x in the {x, y) plane, but not the (x, y) trace. 
Anyway, if a sufficient number of (x', t) traces are known, this information 
is enough for reconstruction. Note that the 3D optical replica pulse in the 
space-time domain can now be reconstructed as a cube of complex-valued 
data. The amplitude is proportional to po(^ ^/ J/)- 

Finally, third, all other techniques presented in this work, including direct 
(coherent) imaging with spectral filter. Fig. 27, linear projection diffraction 
imaging. Fig. 33, and FTH imaging of linear projection. Fig. 46, can operate 
only with a bandwidth-limited optical replica pulse. Note that when we 
extend these techniques to 3D reconstruction methods we can easily over- 
come the energy-chirp problem. However, when we deal with single (x, y) 
projection measurement only, like in Fig. 27, Fig. 33 and Fig. 46, this is not 
possible anymore and one should take care that the energy chirp is actually 
negligible. 



9 Conclusions 

The study presented in this work opens up a novel field in high-energy, 
ultrashort electron beam diagnostic methods, based on the use of coherent 
Optical Transition Radiation (OTR) for 3D imaging of an ultrashort electron 
bunch by means of the Optical Replica Synthesizer (ORS) |i8J. 

The ORS technique was proposed as an attempt to solve a very challenging 
problem, namely the longitudinal electron bunch diagnostics, by producing 
a ID optical replica radiation pulse, thus reducing the electron bunch char- 
acterization problem to the ID optical pulse characterization problem. The 
ORS performs longitudinal diagnostics of ultrarelativistic, ultrashort elec- 
tron bunches in two steps. First, an optical replica of the electron bunch is 
prepared. Second, ID characterization of the electron bunch is performed by 
a femtosecond oscilloscope. For this purpose the ORS setup makes use of a 
commercially available Frequency-Resolved Optical Gating (FROG) device. 
The availability of methods and instrumentation for measuring ultrashort 
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optical pulses constitutes the main advantage of using an optical replica 
compared to other diagnostics schemes. 

In the case of 3D characterization of an electron bunch we demonstrated 
that the same approach can be realized. In other words, the 3D imaging 
of an electron bunch with spatio-temporal coupling can be reduced to 3D 
characterization of an optical pulse with spatio-temporal coupling. How- 
ever, the 3D problem has a different status compared to its ID counterpart. 
In fact, the 3D problem was formulated only recently, and no commercial 
devices are yet available. This does not mean that the schemes discussed 
in this work constitute a challenge as concerns their technical realization. 
The 3D problem has a number of possible solutions, and this explains why 
some of the techniques proposed here (FRODI, HOTRI) are completely orig- 
inal. Note that our formulation of the 3D problem is less general than the 
one addressed by the optical community. In our case of interest, the optical 
pulse is not arbitrary, but being the optical replica of an electron bunch, has 
a duration of about 100 fs and has specific constraints like the bandwidth 
limit, or in more general cases, a phase which is a function of time only. 
This opens up the possibility to use some approaches, as Fourier Transform 
Holography (FTH) with reference source and its variations, which cannot 
be applied for an arbitrary pulse. 

We proposed to use coherent OTR as the source of radiation for the optical 
replica pulse. In the case of coherent OTR we can take advantage of the large 
number of photons and of the coherent properties of the radiation pulse. 
We developed schemes based on direct coherent OTR imaging, diffractive 
imaging, FTH, and a combination of these techniques. In particular, novel 
techniques like FRODI and HOTRI allow for the 3D characterization of 
ultrashort electron bunches by use of single and multishot measurements. 

We proposed to exploit the highly-developed software algorithms for Diffrac- 
tive Imaging and Fourier Transform Holography Reconstruction of 3D im- 
ages in real space-time domain can be based on methods like the Gerchberg- 
Saxton algorithm, the Ftneup algorithm or their generalizations. Moreover, 
similarly to the ORS setup, we proposed to make use of the large selection of 
commercially available instrumentation for optical pulse diagnostics. Per- 
forming diagnostics in the range of visible optics will greatly simplify a 
choice of optical elements or their manufacturing. Also, ultrashort, up to 
10 fs, optical laser sources with any state of polarization are available as a 
reference sources. The same approach, to use already highly-developed de- 
vices, is proposed when considering 3D-imaging methods in the 3D Fourier 
domain. For this kind of applications there is a need for optical systems 
capable of forming high-resolution images at different wavelengths. Imag- 
ing spectrometers are instruments capable of providing this functionality. 
There are two types of imaging spectrometers. One type is based on the 
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use of a plurality of bandpass filters to measure a 3D data cube in the 3D 
Fourier domain for different frequencies. A second type is based on the use 
of a dispersive element, and it is particularly suitable for single shot 3D 
measurements. 

Due to the users' needs, future XFELs will operate with shorter and shorter 
electron bunches. Our proposed diagnostic techniques have the potential for 
extensions from the 100-fs time scale, which we discussed here, to the 10-fs 
time scale of electron bunch duration by straightforward rescaling to shorter 
wavelengths. Exploitation of shorter wavelengths is possible because our 
setups are based on linear optical elements only. The shortest possible wave- 
length compatible with glass optics is about 200 nm, which corresponds to 
the fourth harmonic of a Ti:Sa laser. Using particular geometries, FROG 
devices can operate in this range as well [9J. 

We regard the concepts presented here, based on the combination of ORS 
with coherent OTR imager setups, as the start of a novel direction in diag- 
nostic methods. Therefore, the present work should not be considered as 
comprehensive of all foreseeable applications. On the contrary, we hope that 
our work will stimulate interest and open the door to many new possibilities 
for ultra-fast electron bunch diagnostics. 
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